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Preface 


This text is the product of several years’ effort to develop a course to fill a 
specific educational gap. It is our belief that computer science students should 
know how a computer works, particularly in light of rapidly changing tech- 
nologies. The text was designed for computer science students who have a 
calculus background but have not necessarily taken prior physics courses. 
However, it is clearly not limited to these students. Anyone who has had 
first-year physics can start with Chapter 17. This includes all science and 
engineering students who would like a survey course of the ideas, theories, 
and experiments that made our modern electronics age possible. 

This textbook is meant to be used in a two-semester sequence. Chapters 
1 through 16 can be covered during the first semester, and Chapters 17 through 
28 in the second semester. At Queens College, where preliminary drafts have 
been used, the material is presented in three lecture periods (50 minutes each) 
and one recitation period per week, 15 weeks per semester. The lecture and 
recitation are complemented by a two-hour laboratory period per week for 
the first semester and a two-hour laboratory period biweekly for the second 
semester. 

Computer Science students are usually required to take one year of phys- 
ics; therefore, we examined conventional first-year physics courses with care 
so as to delete topics not relevant to the physics of semiconductors and logic 
circuits (such as Bernoulli’s equation or nuclear decay rates). To achieve this 
contraction with some precision, we developed the second semester, Chapters 
17-28, and taught these chapters for two years before writing Chapters 1-16 
for the first semester. 7 

The first-semester chapters include the First Principles of physics. The 
student normally develops the concepts of sliding and colliding blocks because 
these can be readily visualized and the principles understood from common 
experience. These concepts are later applied to electrons and holes, which 
are not readily visualized; hence the early foundation permits the understand- 
ing of more complex conceptual models. 

The second part of this textbook begins with a presentation of some of 
the phenomena that led to the breakdown of classical physics and the advent 
of the quantum in blackbody radiation, photoelectric effect, electromagnetic 
spectrum of atoms, and so forth (Chapters 17 and 18). After laying down the 
fundamental principles of wave mechanics, de Broglie’s hypothesis, and the 
Uncertainty Principle, we introduce the student to the Schrédinger theory of 
quantum physics. This theory is applied to the simple example of the infinite 
potential well (Chapters 19 and 20). An outline of the solution of the Schré- 
dinger equation (i.e., the three quantum numbers n, |, m;) and some of the 
features of the wavefunctions are presented in Chapter 21. After investigating 


yi PREFACE 


the need for the electron spin, we use these results, together with Pauli’s 
exclusion principle, to determine the electron configurations and some of the 
properties of multi-electron atoms. 

The student is then ready to study the electrical properties of solids. A 
brief discussion of the crystal structure and bonding mechanisms (Chapter 
22) precedes the presentation of the classical and quantum free electron the- 
ories and their successes and shortcomings (Chapter 23). In order to explain 
the large differences in the electrical properties of solids as well as the peculiar 
properties of semiconductors, the existence of allowed and forbidden energy 
bands is investigated (Chapter 24). In this chapter, we introduce the concepts 
of the electron effective mass and of holes. Intrinsic and doped semiconduc- 
tors, their electron and hole densities, and their electrical properties are dis- 
cussed in Chapter 25. 

It is now a rather simple matter for the student to understand the behavior 
and the characteristics of semiconductor devices: diodes, bipolar transistors, 
field effect transistors, etc. Semiconductor devices are the subject of Chapter 
26. The text concludes with two chapters unique to this physics textbook. In 
Chapter 27, we show how diodes and transistors can be used to construct 
the logic circuits (gates) that constitute the fundamental building blocks of 
the computer. Chapter 28 is a layman’s introduction to some of the techniques 
used in the fabrication of integrated circuits. 

The laboratory experiments for the first semester are standard in any 
physics department, and thus we do not feel that is is necessary to include 
them in this book. The seven experiments in the second semester have been 
designed to illustrate important measurements; they are not standard in most 
physics courses. The equipment needed for these experiments is not generally 
available in quantity in physics departments but can be ordered from standard 
suppliers or constructed in school shops. For this reason the seven laboratory 
experiments are described in the instructor’s manual that complements this 
textbook. 

Although some of the topics and the level of treatment of Chapters 20- 
27 may appear to be potentially formidable for a first course in physics, in 
practice they have proved not to be so. We have tried to soften the impact 
of mathematical complexities with intuitive physical arguments. For example, 
the existence of energy bands in solids is first introduced by outlining the 
solution of the Schrédinger equation for an electron moving in the periodic 
potential of the ions. The student is shown that the imposition of the boundary 
conditions leads to a transcendental equation for the dispersion relation whose 
numerical solution leads in turn to the existence of forbidden and allowed 
energy bands. (Students are encouraged to use their computer programming 
skills to solve this equation.) This presentation is followed by an alternative 
approach that relies on simple intuitive arguments; here, the mathematics is 
kept to a minimum. The student is shown how, when two atoms are brought 
together to form a molecule, two separate energy levels are formed from each 
level of the individual atom. This is then extended to a situation where a large 
number of atoms come together to form a solid. It is thus relatively simple 


to show that each atomic level becomes a band of very closely spaced energy 
levels. 

By blending mathematical and physical arguments, we believe we have 
achieved a thorough presentation of the concepts of solid state physics while 
staying within the reach of students with an intermediate background in 
mathematics. 


Narciso Garcia 


Arthur C. Damask 
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CHAPTER 1 


Physical Quantities 


2 PHYSICAL QUANTITIES 
1.1 INTRODUCTION 


People have always observed natural phenomena and then verbalized their 
observations for discussion with others. With the development of physics, the 
words in the verbal descriptions were given symbols, which could then be 
manipulated according to the rules of mathematics. 

In physics two fundamental processes are involved. The first is the de- 
scription of natural phenomena based on experiments, which control vari- 
ables. Theories are not accepted by physicists until verified by experiment. 
The second is mathematical manipulation or theorizing, which is a predictive 
process. If the observed quantities have been described properly and given 
the proper symbols, then the subsequent mathematical manipulations will 
result in new relationships that must be correct on testing, else their for- 
mulation will be rejected as incorrect or inadequate. Furthermore, the results 
of the relationships must stand the test of time. In this sense, time means 
enough time for many experiments to be performed to test the relationships. 
The laws presented in this book have met these requirements. 

Although human beings have observed Nature from their first existence, 
it was not until the time of Galileo (1564-1642) that these observations began 
to be expressed in modern mathematical terms. Subsequent studies, mea- 
surements, and critical evaluations developed what we now call the First 
Principles of physics, which have truly stood the test of time. The first six 
chapters of this book discuss these principles and show how they are used 
in their simplest form. Other areas of physics must satisfy these First Principles. 
Chapters 7 through 16 illustrate the use of these principles in rotational mo- 
tion, the behavior of gases, and electric and magnetic phenomena. Chapters 
1 through 16 constitute what is usually known as classical physics. In the 
remaining chapters, we introduce a different way of describing the behavior 
of small physical particles. For example, although we may continue to consider 
the electron as the smallest negatively charged particle, experiments have 
shown that its behavior can also be described as a wave instead of a particle. 
The mathematics of waves, instead of that of particles, must be used to explain 
the electron’s behavior in certain situations, whereas the mathematics of par- 
ticles still applies in other situations. This revolution in thought, begun in 
the early part of this century, has led to the method, or science, of wave 
mechanics, which is more generally called quantum mechanics. 

When the behavior of an electron within a solid is sought, very little can 
be learned by the particle treatment, but a vast amount of understanding can 
be achieved by the wave approach. How does this fit in with our mention of 
the test of time and observation? Although the actual length of time of this 
modern model (about 80 yr) is short compared with the time since Galileo, 
the number of experiments that have been performed is far greater. It has 
been said that of all the physicists who have ever lived, 95% are still alive. 

The observation requirement is somewhat more subtle. We cannot ob- 
serve fundamental particles such as an electron in the same way that we 
observe macroscopic objects; they are too small. In fact, we will later discuss 


the Uncertainty Princtple, which will show that the mere act of observing will 
change some state of the particle. Because of this principle, experiments that 
would completely characterize a small particle are too difficult to perform. 
What we do observe is the statistical behavior of a vast number of particles, 
and we infer the average behavior of a member of the statistical ensemble. 
Therefore, although we may never know the physical parameters of the in- 
dividual particle, there are many physical experiments that can tell us if the 
statistical model is satisfactory. 


1.2 QUANTITIES AND UNITS 


If a physical phenomenon is to be quantified, there must be suitable, agreed- 
on measuring devices. Many measuring systems have been created in the 
past, none perfect. It is desirable to have a measurement system that has the 
least number of fundamental parameters. In classical physics, these are length, 
mass, and time. All other quantities can be defined in terms of these three. 
For example, speed is the ratio of a length to a time. We then choose a standard 
for each of the fundamental parameters. Most scientific measurements use 
the metric system. There are two versions of the metric system in use, the 
cgs (centimeter, gram, second) and the mks (meter, kilogram, second). Al- 
though the cgs system is still often used in the biological sciences, most 
measurements by physicists now use the mks system. This is the mechanical 
part of the more general SI (Systeme International) that covers all physical 
measurements. The English system of units (foot, pound, second) is used 
largely by engineers. We will mostly use SI units in this book. 

The unit of a quantity is as important as the magnitude, as indicated in 
the quote from The New York Times at the beginning of this chapter. It is 
meaningless to say “the distance between two points is 10,” because the 10 
may be meters, miles, or inches. The units are an integral part of the mea- 
surement and must be treated algebraically. One may substitute for them or 
convert them to a different system, but they cannot be gotten rid of except 
by an algebraic process. For example, 7 is dimensionless, but it is defined on 
the basis of two measured quantities, the circumference of a circle divided 
by its diameter. It is independent of units because they cancel: circumference 
(meters)/diameter (meters), and it is seen that meters cancel algebraically. 

Conversion of units must be done with care, and, in order to convert, a 
relationship between units of two different systems must be known. Let us 
illustrate this with a trivial example. How many feet are there in 5 mi? 


5 mi = ? ft (1.1) 
We know the relation 
5280 ft = 1 mi (1.2) 


How do we substitute this with care? The safest rule is always to multiply 
by one (unity), because we know that in algebra, multiplication of anything 
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United States copy of the original 
platinum-iridium bar which for 
many years was the standard of 
length: the meter. 


4 PHYSICAL QUANTITIES 


by unity leaves it unchanged. Our conversion relation, Eq. 1.2, can be made 
into two different forms of unity depending on whether we divide both sides 
by miles or by feet: 


5280 ft Lmi _ 
lmi 5280 ft 


We can now multiply the left side of Eq. 1.1 by the first form of unity, which 
algebraically results in the unit 


mift 


mi 
Multiplication by this gives 
5280 ft 
1 yet 


We see that miles in both the numerator and the denominator have cancelled 
algebraically, leaving feet as the unit. 

We can do more than one algebraic step at a time. For example, how 
many seconds are in 1 day? 


5 mi = 5 mt x = 5 xX 5280 ft = 26,400 ft 


1 day = ? sec 
We know three conversion relations 
1 day = 24h 1h = 60 min 1 min = 60 sec 
We select our choices of unity to give successive algebraic cancellations 


y= (28) (at) (a 


= 24 x 60 x 60 sec = 86,400 sec 


We may convert two units simultaneously in a single equation to save a lot 
of writing. For example, a car traveling at 60 mi/h travels how many feet per 
second? 


ft/sec = 88 ft/sec 


60 = 60 BE 5280 ft 1K lmint\ 60 x 5280 
7 1 mt / \60 mit] \60sec/ 60 x 60 


h 

Remember that in square or cubic units, all measurements must be in 
the same units. It makes no sense to calculate the area of a room if its length 
is measured in feet and its width is measured in meters. Both measurements 
should be in the same system. Also remember that when converting square 
or cubic units they must be squared or cubed just as algebraic quantities. For 
example, how many cubic centimeters (1 m = 100 cm) are there in a volume 
of 1 m°? 


100 cm 100 cm 100 cm 
1m? = 3 = 1,000,00 3 
me = ye (SPER) (AP) (Ap) = 1,000,000 om 


1.3 POWERS OF 10 


Often, very large and very small numbers arise in physics. In 1 cm” of a solid 
there are a vast number of atoms, about 1 followed by 21 zeros. Measurements 
have shown that the range of atomic diameters in meters is between 0.0000000001 
and 0.0000000003 m. Because of the difficulty of reading and writing numbers 
with many zeros, we use powers of 10 notation. Recall the algebraic postulate 
that any quantity to the zeroth power is, identically, unity. A brief table of 
some powers of 10 follows. 


10° = 1 107? = io = 0.1 
10° = 10 10-2? = zo = 0.01 
10? = 100 10-2 = zoo = 0.001 
10° = 1000 10-* = zoo00 = 0.0001 
10* = 10,000 


Some of these are used as prefixes; for example, 10° = kilo, 107° = milli, 
10~© = micro, 10°? = nano, 10° = pico. 

Let us review the algebra of adding and multiplying powers with the 
letter a representing 10. (These rules apply for a equal to any value other than 
Zero.) 


Example 1-1 
(b x a"\(c X a™) = bea"a” = bea"™™ 


Substitute arbitrary numbers for the letters; for example, b = 2, c = 3,n = 
4,andm = 2. 


(2 x 104)(3 x 102) = 2 x 3 x 10* x 107 = 6 x 10° 


Example 1-2 


_ © gn-m 
fxa"™  f 
Ife = 6,f =2,n = 4,andm = 2, 
6x 10* 6 


7x 102 > 5 lot = 3 x 10° 
Example 1-3 
(b x a") + (c X a™) 
This form can be simplified if n = m, then 
(b x a") + (c X a") = (b + c) a” 
If n # m, they can be made equal. For example, let 


b=2andc=3, n=4andm=5 
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substituting in the algebraic relation, we have 
2 x 10* + 3 x 10° 
But 10° = 10’ x 10* and therefore 


2x 10¢ +3 x 10° = 2 x 10? + 3 x 10! x 101 
= (2 + 30)10* = 32 x 10* = 3.2 x 10° 


This sum could also be done by converting the first term 


1 


10 
2x 7 x 1h + 3 x 10° = 0.2 x 10° + 3 x 10° = 3.2 x 10 


In scientific notation, usually only one digit is placed in front of the 
decimal point. 


1.4 ACCURACY OF NUMBERS 


Suppose we wish to find the area of a rectangular surface. We know that we 
multiply the length | by the width w. Suppose we take a metric ruler to 
measure / and w. The metric ruler’s smallest division is the millimeter, 10~3 
m. Figure 1-1 illustrates the use of a metric ruler to measure the length of the 
rectangle. We can see that the measure of length lies between 47.6 and 47.7 
cm; from the position of the edge of the rectangle we can estimate the second 
decimal as being less than 0.5 mm but not less than 0.3 mm. We can therefore 
express our measurement as 47.64 + 0.01 cm, or 0.4764 + 0.0001 m. Thus, 
the last digit is always uncertain and the + value is the magnitude of the 
uncertainty. Suppose we have measured the width as 0.6343 + 0.0001 m; 
What is the accuracy of the calculation of the area? We examine the two 
extremes. The largest area is 


0.4765 m X 0.6344 m = 0.3023 m2 
and the smallest is 


0.4763 m x 0.6342 m = 0.3021 m2 


We can write the answer as the average between the two values with + the 
uncertainty, or 0.3022 + 0.0001 m?*. Therefore, the accuracy of the product 
cannot exceed the accuracy of any of the components in the product. 

This type of analysis must be extended to all data-handling, for example, 
sums, differences and quotients. Suppose that we want to sum two numbers 
known with different degrees of accuracy. What is the accuracy of the sum? 
As an example, consider three successive points A, B, and C on a straight 
line. If the distance between A and B is 15.75 m and the distance between B 
and C is 2.432 m, what is the distance between A and C? The answer is 
obtained by summing the two numbers. If we use a calculator to evaluate the 
sum, it will tell us that the answer is 18.182 m. However, the only reliable 
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answer is 18.18 m, because we do not know what the third digit after the 
decimal point of the first number (15.75 m) is. No matter how accurately a 
given parameter is measured, when it is combined arithmetically with another 
measurement the result is only as accurate as the least-accurate measurement. 
The number of accurate figures in a measurement is called the number of 
significant figures. Computer science students are often misled because com- 
puter or calculator answers may have 8 to 10 figures. In general, most of these 
figures have no significance and the answer should be rounded off to the 
lowest number of significant figures of the quantities used in the calculation. 
It is incorrect to give an answer with a greater number of figures. 


PROBLEMS 


1.1 Express your height in meters, using the rela- 
tion 1 in. = 2.54 cm. 


1.2 Use the relations 1 mi = 5280 ft and 1 m = 
3.28 ft to express the speed 60 mi/h in meters per 
second (m/sec). 


1.3 How many kilometers are in 1 mi? 


1.4 Express the following in scientific notation (a 
single digit to the left of the decimal): 


0.038, 0.000042, 5280, 62.356, 
(4 x 103 + 3 x 107)6 x 1073, 
3.2 x 1046.1 x 10-2 + 9.2 x 1073) 


1.5 Express the following operation in scientific no- 
tation 


4 x 107 + 6 x 10° 
2x 1074 


1.6 Light travels at 186,000 mi/sec. Assume an av- 
erage of 365 days in a year. (a) How many years 
does it take the light to reach us from the sun, which 
is 9.3 X 10’ mi from the earth? (b) How many years 
does it take the light to reach us from the nearest 
star, other than the sun, which is 1.8 x 10'° mi from 
us? 


1.7 Astronomers measure large distances in a unit 
called the light-year. This is the distance that light 
traveling at approximately 186,000 mi/sec will travel 
in 1 yr. How many miles in 1 light-year? 


1.8 Assume that the average lecture period is 1 
microcentury (10~° centuries); how long is the lec- 
ture period in minutes? 

(Answer: 52.6 min.) 


1.9 A light-fermi is a unit of time proposed by sci- 
ence-fiction writer Isaac Asimov. It is defined as the 
time taken by light to travel the distance of 1 fermi 
(10~*° m), which is the approximate size of the pro- 
ton. How long is a light-fermi in seconds? Light trav- 
els at3 x 10° m/sec. 


1.10 There are approximately 8 x 107° copper 
atoms in 1 m° of copper. (a) What is the volume 
occupied by a copper atom? (b) What is the radius 
of a sphere having that volume? (volume of a 
sphere = 4 mr°/3) 


1.11 Assume that atoms have spherical shape with 
average radius 4 x 10~'° m. How many atoms are 
there in the earth? Neglect the volume lost in pack- 
ing the spheres and take the average radius of the 
earth to be 6.37 x 10° m. 

(Answer: 4.04 x 10*°.) 


1.12 In the Old Testament the Lord commanded 
Noah to build an ark 300 cubits long, 50 cubits wide, 
and 30 cubits high. A cubit is the length from a man’s 
elbow to the tip of his extended middle finger. We 
do not know Noah’s height, so measure a cubit from 
both a short person and a tall person. Assume the 
ark was a parallelepiped with right angles. (a) What 
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is the maximum and the minimum values of its vol- 
ume? (b) Assuming that the average animal required 
a space of 2 x 4 x 6ft*, and that one half the volume 
of the ark was for food and passengers, what is the 
possible variation in the number of animals that could 
be accommodated? 


1.13 Density is defined as the mass per unit vol- 
ume. Take the average density of the earth to be 5.5 
g/cm? and assume that the earth is a sphere of radius 
6.37 X 10° km. Calculate the mass of the earth. 
(Answer: 5.96 x 10** kg.) 


1.14 A neutron is one of the constituent particles 
of the nucleus. The mass of the neutron is 1.67 x 
10~*” kg. Assuming that the neutron is a sphere of 
radius 1 F (10~* m), what is the density of the neu- 
tron in g/cm°? Compare your answer with the av- 
erage density of the earth (see problem 1.13). 


1.15 The radius of a carbon atom is about 2.5 x 
107° cm. (a) How many could fit in a row 1-cm long? 
(b) How many could fit in a layer one atom deep 
and area 1 cm?? (c) How many could fit in a cube 1 
cm on each size? (d) If a crystal of carbon atoms 
(diamond) had this form, what is the minimum 
number of impurity atoms that could block the light 


coming through the faces of a 1-cm° cube? Express 
your answer in both percent and in parts per million. 
(Hint: Assume as an approximation that a layer of 
impurity atoms on each of three faces of the cube at 
right angles to each other could block all the light.) 
(Answer: (a) 2 X 10’ cm™!, (b) 4 x 10** cm~?, 

(c)8 x 107 cm, (d) 1.2 x 10%, 1.5 x 10°° %, 
0.15 ppm.) 


1.16 The distance x of an object from a certain or- 
igin is found to vary with time t as x = a, + dot + 
a;t7, where x is in meters, ¢ is in seconds, and 4, a, 
and a3 are constants. What are the units of a1, do, a3? 


1.17. A student is trying to find what parameters 
determine the period (time for a full swing) of a 
pendulum. After some experimentation, he con- 


cludes that the period T is given by T = 2n® where 
g = 9.8 m/s? is the acceleration of free-falling bodies 
near the surface of the earth and / is the length of 
the pendulum. (a) Show by the units of the terms 
that the student’s conclusion is incorrect. (b) Assum- 
ing that the period depends only on g and I, what 
is the proper functional dependence of T on these 
two quantities? 


CHAPTER 2 


Vectors 
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2.1 INTRODUCTION 


We will be dealing with two types of quantities in this book. Some quantities 
are fully specified only by a number and a unit, such as a quart of milk or a 
pound of potatoes. Such a quantity consisting only of magnitude is called a 
scalar quantity. Other measurements have meaning only if direction is spec- 
ified along with the magnitude. For example, telling a stranger that a gas 
station is 1 mi away will not help him unless you specify the direction also. 
A quantity that has both magnitude and direction and obeys certain algebraic 
laws is called a vector quantity and will be indicated in this book by boldface 


type. 


2.2 VECTOR COMPONENTS 


A vector direction must be specified in relation to a given coordinate system. 
Given a coordinate system, any vector can be expressed in terms of its com- 
ponents. 

Let us examine the concept of vector components first by simply using 
the compass points north, south, east, and west as the directions. 

Suppose you walk 5.0 mi east and then 4.0 mi north. How far are you 
and in what direction from the starting point? 

We draw Fig. 2-1. We see that we have right-angle geometry and, because 
R is the hypotenuse, using the pythagorean theorem, we have 


R = V(4.0 mi)? + (5.0 mi)* = 6.4 mi 
We also know from trigonometry that 


eno epposiie is _ 4.0 pt _ 0.8 
adjacent side 5.0 mf 


or 
@ = arctan 0.8 = 39° 


The distance R together with its orientation 0 is called the vector sum of the 
two vectors 5.0 mi east and 4.0 mi north and is given the name resultant. The 
5.0 mi east vector and the 4.0 mile north vector are called the components 
of R. 

Suppose, instead, you walk 5.0 mi east and 4.0 mi northeast, namely, 
45° north of east as in Fig. 2-2. What is the resultant? 

This is a little more complicated and, although you could use the law of 
cosines and the law of sines to solve the problem, there is a simpler way that 
will be used throughout the book. This method is particularly useful when 
you deal with problems that involve more than two vectors. Let us reduce 
the problem to two questions. 


1. How far are you to the east of the starting point? 
2. How far are you north of the starting point? 


5.0 mi 


FIGURE 2-1 


5 mi 


FIGURE 2-2 


4.0 mi 
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Note that the additional dashed lines in Fig. 2-2 labeled E, and N> represent 
the respective distances east and north traveled in the second leg. In this 
second part of the walk we see again a right triangle with E, and N> as the 
legs and with 4.0 mi as the hypotenuse. Recall from trigonometry that sin 
45° = N,/4.0 mi and cos 45° = E,/4.0 mi. Therefore, No = (4.0 mi) sin 45° 
and E, = (4.0 mi) cos 45°. Now make a table of the data 


FIGURE 2-3 
East North 
Walk 1 5 mi 0 mi 
Walk 2 (4.0 mi) cos 45° = 2.8 mi (4.0 mi) sin 45° = 2.8 mi 
Total 7.8 mi 2.8 mi 


Take a fresh piece of graph paper and plot these distances from a starting 
point, which will be at the origin of a compass coordinate system, as in Fig. 
2-3. The point marked x is your location from the starting point, R is the 
distance, and @ is the angle. We now have right-triangle geometry again and 
may write as before 


*mi 


(7.8 mi)? + (2.8 mi)? — §3mi ABN 


R= 
2.8 =) 
= t = 19.7° 
6 arctan eal 


Consider the more complicated walk of Fig. 2-4. What is the resultant K 
of the four displacements shown? Of 
To find R and 6, make a table of east-west and north-south displacements. payne 9.4 
Note here that the table will be in terms of east and north, so that a displace- 
ment to the west will be a negative east displacement and one to the south 
will be negative north. The components of the 4.0 mi, 5.0 mi, 7.0 mi, and 2.0 
mi displacements can be found by putting pieces of graph paper with the 
origins at the starting points of these legs and finding the components by 
right-triangle geometry, as in Fig. 2-9. 
Construct a table as in the previous example. 


FIGURE 2-5 
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East North 
Walk 1 (4.0 mi) cos 30° = 3.5 mi (4.0 mi) sin 30° = —=—_ 2.0 mi 
Walk 2 (5.0 mi) cos 60° = 2.5 mi (5.0 mi) sin 60° = = 4.3 mi 
Walk 3 —7.0 mi = —7.0 mi Omi= O mi 
Walk 4 —(2.0 mi) cos 45° = —1.4mi  —(2.0 mi) sin 45° = —1.4 mi 
Total —2.4 mi 4.9 mi 


Now take a piece of graph paper and plot the total east and north displace- 
ments as in Fig. 2-6. 


R = V(—2.4 mi)? + (4.9 mi)? = 5.5 mi 
4.9 pa 
6 = arctan Pea iL = 63.9° north of the west direction 
| —2.4 pad 


What we have done in Fig. 2-6 is to define an angle 6@ as less than 90°. 
This makes both the calculation and the spatial location much simpler. To 
work with 6 < 90° we had to ignore the sign of the coordinate and locate the 
resulting angle on the graph. Had we kept the sign of the coordinate in the 
calculation of the angle, it would not have helped much because the tangent 
is negative in two of the quadrants. We would still have to rely on some 
construction to locate the angle. The method shown in the preceding example, 
which first uses a graph to show where you are, leaves no question as to the 
meaning of the angle 6. How does one specify the angle? It is equally correct 
to say it is 64° north of west or, using the 360° scale with east as 0°, the angle 
would be 180° — 64° = 116°. One other point should be noted. In Fig. 2-5 
we symbolically used four pieces of graph paper to obtain the components 
of the vectors. We could have equally used a single piece by putting the 
beginning of each of the walks at the origin of the graph paper as in Fig. 
2-7; henceforth, we will conserve paper by this method. 

Now that we have related coordinate systems to navigation, we can apply 
the same techniques to cartesian coordinates x-y instead of compass directions. 
To show how the vector component method is used in more complicated 


FIGURE 2-7 


4.9m 


E (west) 


FIGURE 2-6 


situations, we will abandon our walks and consider forces. The concept of 
forces will be developed more fully in Chapter 4. For now, we can simply 
rely on our experience that a force is that which when exerted in some di- 
rection against an object may or may not cause it to move. It is a vector 
quantity. 


Example 2-1 


A box is pulled by two persons exerting the forces F; and F, shown in Fig. 
2-8, where F, is given as 50 lb. Two questions may now be asked. 1. What 
force F, must be applied so that the box moves only in the x direction? 2. What 
single force could replace F; and F, so that the box moves only in the x 
direction? 


Solution We obtain answers to these two questions by first constructing 
a vector diagram of the forces, as in Fig. 2-9, and tabulating the components 
of the forces. 


Force x components y components 
F, (50 Ib) cos 30° = 43.3 Ib — (50 Ib) sin 30° = —25.0 lb 
F, F, cos 37° = 0.8 F, F, sin 37° = 0.6 F, 
Total 43.3 Ib + 0.8 F, —25.0 lb + 0.6 F, 


If the object is going to move in the x direction, the resultant force must 
be in the x direction only, with no component in the y direction. If there is 
to be no net force in the y direction that could cause the box to move in that 
direction, then the sum of the positive and negative y forces must be zero. 
We can express this as 


ee 
—25 lb + 0.6 Fz = 0 


| 
o 


OT 


Question 2 can be answered from the sum of forces in the x direction. 
_2 F, = 43.3 lb + 0.8 Fz 

43.3 Ib + 0.8 X 41.7 Ib 

76.7 |b 


Therefore, we conclude that instead of the two forces F; and F, acting on the 
box, the same result can be obtained by a single force of 76.7 lb pulling in 
the x direction. 
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FIGURE 2-8 
Example 2-1. 


FIGURE 2-9 
Example 2.1. 
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We can now summarize the preceding discussion concerning the addition 


of vectors by the method of components. 


if 


Establish a coordinate system. Choose an x-y cartesian coordinate system 
that is convenient for calculation. For example, we will see in Chapter 4 
that when an object slides on an incline plane, it is convenient to choose 
the plane as the x axis. 


. Construct each vector with its tail at the origin of this coordinate system. 
. Drop construction lines from the head of each vector perpendicular to the 


x and y axes, and note from the laws of trigonometry that the x component 
of a vector is equal to the magnitude of the vector multiplied by the cosine 
of the angle that the vector makes with the x axis. Similarly, the y com- 
ponent of a vector equals the magnitude of the vector times the sine of 
the same angle. 


. Add algebraically the individual x components and y components of all 


the vectors to find the x and y components, respectively, of the resultant 
vector. 


. The square of the resultant equals the sum of the squares of the x and y 


components of the resultant. 


. To determine the angle of the resultant, it is best to make a sketch of the 


resultant, showing its x and y components, which will indicate in what 
quadrant the resultant is. The angle between R and the x axis (either 
positive or negative direction) is equal to the tangent of the absolute value 
of the y component of the resultant divided by the absolute value of the 
x component. 


We conclude this section by defining certain rules of vector algebra. 


. If we have a vector A, we define a vector — A as one whose magnitude is 


the same as that of A but its direction is opposite to that of A. 


. A vector 2A is one whose magnitude is twice that of A and whose direction 


is the same as that of A. More generally, when a vector is multiplied or 
divided by a scalar quantity, we obtain a vector of different magnitude but 
of the same direction as the initial vector. 


. When several vectors are added, they can be added in any order and thus 


the distributive law holds; for example, (A + B) + C= A+ (B+ OC). 


2.3 UNIT VECTORS 


It is lengthy to write and say x component, y component, and z component. 
A shorthand notation is used. Unit vectors i, j, and k are introduced. These 
have the respective directions x, y, and z and a magnitude of unity, so they —_ Unit vectors i, j, k on the three 


give direction without changing the magnitude. For example, the vector F, 


va 


FIGURE 2-10 


coordinate axes. 


gi “T 


in Fig. 2-9 that has components 43.3 lb in the x direction and — 25 lb in the 
y direction would simply be written as 


F, = (43.3i — 25 j)lb 


The conventional diagram for three dimensions is shown in Fig. 2-10 with 
the corresponding unit vectors. 

The vector F in Fig. 2-11 has the components shown, which are obtained 
by the right-triangle method of extending perpendiculars to the axes. This 
vector would be written as 


F = (4i+ 8j+5k)b 


and its magnitude is obtained from the three-dimensional pythagorean theo- 
rem | 


F = V(41b)2 + (8 lb»? + (5 Ib)? 
10.2 Ib 


When desired, the direction can be obtained by standard methods of analytic 
geometry to obtain its location in space. 


2.4 DOT PRODUCT 


Very often in physics we have two vectors with an angle @ between them, 
and we wish to find the product of their components that lie in the direction 
of one or the other vector. 

Consider Fig. 2-12. If we, for instance, select the A direction, then the 
component of vector B in that direction is given by dropping a perpendicular 
(Fig. 2-12 a) and noting from the resulting right triangle that the component 
of B in the A direction is B cos 6 and the product of this component and 
vector A is 


AB cos @ 


If, instead, we had selected the B direction we could equally have dropped 
a perpendicular from vector A to the line of vector B (Fig. 2-12b) and obtained 
the identical result. Because there is no specified direction for the resulting 
product, we define such a product as a scalar. We use the shorthand notation 
of a dot (-) to represent this type of product, which is referred to as the dot 
product ; 


A: B = AB cos 6 (2.1) 


where on the right side A and B are simply the magnitude of each of the 
vectors. We will use this dot product in Chapter 5 on work and energy, both 
of which arise from vector relationships although neither in itself has direc- 
tion. 
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FIGURE 2-11 
Components of vector F on the 
three coordinate axes. 


A+B = AB cos @ 


FIGURE 2-12 

Geometric representation of two 
ways of forming a dot product of 
vectors A and B. 
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Let us apply our definition of the dot product to the unit vectors i, j, 
and k. 
i-j = (1)(1) cos 90° = 


i-k = (1)(1) cos 90° = 
j:k = (1)() cos 90° = 
i-i = (1)(1) cosO® = 
j-j = (1)(1) cos 0° 
k-k = (1)(1) cos 0° 


— me me CD ODO & 


We see that when a unit vector is dotted with a different unit vector the result 
is zero, whereas when a unit vector is dotted with itself the result is unity. 


Example 2-2 
Find A: Bif A = 3i + 2j and B = —i + 3j. 


Solution 


> 
ow 
! 


(3i + 2j)-(-i + 3j) 

3i- (-i) + 3i- 3) + 2j-(—D + Qj 3; 
8) 4 0 40 46 

= 3 


You can verify that B- A gives the same answer; therefore the commu- 
tative law holds for the dot product of two vectors. 


2.5 CROSS PRODUCT 


In some topics of physics we often need to define a vector C, whose magnitude 
is equal to the magnitude of one vector A times the component of a second 
vector B in the direction perpendicular to A (see Fig. 2-13). Moreover, we 
want the direction of C to be perpendicular to A and B. 

We thus introduce a new type of product called the cross product of A 
and B. If C is the cross product of A and B, we write 


C=AXxB (2.2) 


By this definition, it is seen in Fig. 2-13 that the magnitude of the vector C 
is 


C = AB sin 0 


The direction of C is perpendicular to both A and B and consequently per- 
pendicular to the plane containing A and B. There are obviously two possible 
directions for a vector perpendicular to a plane. This ambiguity can be re- 


[A x B| = AB sin 0 


FIGURE 2-13 


Geometric representation of the 
cross product A x B = C, where 
C = AB sin 0. 


moved by using a right-hand rule. A simple mnenomic is the following. Con- 
sider the two vectors to be two sticks connected by a hinge at the apex of the 
angle. Mentally place the palm of your right hand against the outside of the 
first stick to be crossed (in our case A) as if to push the two sticks together 
(see Fig. 2-14). Do this with the thumb extended and the thumb will point 
in the direction of the vector cross product. The vector C’ = B X A will, 
from the definition, have the same magnitude as C. However, it is clear from 
Fig. 2-15 that the direction of C’ is opposite to that of C; namely, 


AXBz= -BXxXA 


Note that this is in contrast to the dot product where A: B = B- A. 
We can examine the resulting directions of cross products by operating 
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on the unit vectors of Fig. 2-10 with the right-hand rule. We find that FIGURE 2-14 
ixj =k Right-hand rule for the vector 
cross product. For A x B, curl 
jxk=i the fingers of the right hand in 
; ; a direction such that the fingers 
k xi =j seem to push vector A toward 
jxi = —-k vector B. The direction of the 
thumb points in the direction of 
kxj=-i the vector C = A X B. 
ixk= -j 


It should be noted in the definition thati xX i = j X j = k X k = 0 because 
the angle between a vector and itself is zero and sin 0° = 0. 


Example 2-3 
Find A x Bif A = 3i + 2j and B = —i+ 3j 


Solution 


A X B = (31 + 2j) X (—-i + 3j) 
= 3i x (-i) + 3i X 3j + 2j X (-1) + QB X 3j 
= 0+ 9k + 2k + 0 
= 11k 
PROBLEMS 


2.1 What are the x and y components of the fol- 
lowing vector displacements? (a) 2 m at 20°? (b) 3m 
at 120°? (c) 4 m at 240°? (d) 2.5 m at 325°? All angles 
are with respect to the positive x axis. 


FIGURE 2-15 
Right-hand rule for the vector 
cross product of B xX A = —C. 


2.2 A sailboat follows a series of racing buoys. On 
the first lap it goes 8 mi at 20°, then 10 mi at 40°, 
then 6 mi at 130°. What distance is it from its starting 
point, and in what direction must it sail to return? 

(Answer: 17.8 mi, 230.6°.) 
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2.3. A sailboat sails 7 mi in the direction 37° north 
of east, then 4 mi in the direction 53° west of north. 
What is the magnitude and the direction of the final 
leg that will bring it to the starting point? 


2.4 A vector displacement A in the x-y plane has 
an x component of 10 m. The angle between the y 
axis and the vector A is 37°. What is the magnitude 
of the vector A? 


2.5 A force of 100 Ib acts on an object at an angle 
of 20° with respect to the x axis, and a force of 300 
Ib acts at an angle of 60° with respect to the x axis. 
What single force must be applied at what angle to 
be the equivalent of these two forces? 

(Answer: 382 lb, 50.3°.) 


2.6 In problem 2.5 an additional 200 Ib acts at 215°. 
What single force at what angle will be the equiva- 
lent of these three forces? 


2.7 Consider the case discussed in Example 2-1, 
except that F; = F, = 50 Ib. The two men exerting 
these forces on the box ask a small boy to push on 
the box while they pull it so that it moves only in 
the x direction with a net force in that direction of 
90 Ib. With how large a force and in what direction 
does the boy push on the box? 

(Answer: 8.4 lb, —37°.) 


2.8 Express the vectors of problem 2.1 in i, j, k 
notation. 


2.9 Express the vectors of problem 2.2 in i, j, k 
notation and perform the summation in that nota- 
tion. 

(Answer: 11.31 + 13.8].) 


2.10 What is the magnitude and the angle of the 
resultant of vectors A, B and C, where A = 21 + 
3j, B = 4i — 2j, and C = —i + j? 


2.11 The sum of three vectors A, B, and C is equal 
to vector R. If A = 21 — 3j, B = —1 + 2j, and R = 
—2i + 3j, what are the components of vector C? 
Make a sketch of vector C on a cartesian system, 
find its magnitude and the angle it makes with the 
x axis. 


2.12 The resultant of vectors A, B, and C is 2i + 
j. lf A = 6i — 3j and B = 2i + 5j, find the com- 
ponents, the magnitude, and the angle of vector C. 


2.13 Vectors A and B have magnitudes of 3 m and 
4 m, respectively, and are 30° apart. Find A - B and 
A X B. 

(Answer: 10.4 m?, 6.0 m?.) 


2.14 Ifthe vectors A and B of problem 2.13 are 150° 
apart, find A- Band A xX B. 


2.15 Find the dot and cross products of vectors A 
and B of problem 2.13 if they are 0° apart. If they 
are 180° apart. 


2.16 Find the vector A x Bif A = i —3j + 2kand 
B = —2i — j + 3k. 


2.17 Find the dot product A- Bif A = 3i + 4j — 
k and B = —3j — 12k. What are the magnitudes of 
vector A and vector B? 


2.18 Use the dot product to find the angle between 
vectors A and B in problem 2.17. 


2.19 Find the angle between the vectors A = 4i + 
3j and B = 6i — 3}. 
(Answer: 63.4°.) 


2.20 What is the angle between the vector A = 
31 — 7j and the x-axis? 


2.21 Find the angles between the vector A = 2i — 
3j + 5k and the x, y, and z axes, respectively. 
(Answer: 71.1°, 119.1°, 35.8°.) 


2.22 Consider a vector A = 4i — 9j. Find a vector 

in the x-y plane that is perpendicular to A. 
(Answer: a (2.251 + j), where a is an arbitrary 
constant.) 


2.23 A-vector R = 9i — 12j can be expressed as a 
linear combination of two vectors A and B; namely, 
R = C, A + CB, where C, and C; are two scalar 
constants. If A = 5i — 3j and B = —i + 12j, what 
are C, and C,? | 

(Answer: C; = 1.68, C2 = —0.58.) 


2.24 The resultant R of two vectors A and B has 
half the magnitude of A and is perpendicular to B. 


If the magnitude of A is 5, what is the magnitude 
of B and the angle between A and B? 


2.25 Many theorems in geometry can be readily 
proved by vector algebra. Consider the triangle OAB 
in Fig. 2-16. Show that the line joining the midpoint 
of side OA to the midpoint of side AB is parallel to 
OB and its length is half that of OB. (Hint: Make 
vectors out of OA, OB, AB, CA, AD, and CD and 
find relations between these vectors) 


FIGURE 2-16 
Problem 2.25. 4 


2.26 Consider the line obtained by joining the or- 
igin and the point x = 5, y = 3 (see Fig. 2-17). Find 
the perpendicular distance h from a point P with 
coordinates x = 1, y = 7 to that line. 

(Answer: 5.49.) 
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2.27 The magnitude of vectors A and B are 4 and 
10, respectively. The magnitude of the resultant R 
is 12. What is the angle between A and B? 


FIGURE 2-17 
Problem 2.27. oO 


2.28 What is the area of the triangle formed by 
joining the following three points: x = 0m, y = 0 
mx=3m,y =4m;x =7m, y = 2m. Recall that 
the area of a triangle is Area = 1/2 base x height. 
(Hint: Make vectors out of two of the sides of the 
triangle and consider the magnitude of the cross 
product of those two vectors.) 

(Answer: 11 m2.) 
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3.1 INTRODUCTION 


In this chapter we introduce certain vector quantities—position, displace- 
ment, velocity and acceleration—used to describe the motion of a body. We 
define these quantities and discuss the mathematical relations between them. 
We then derive specific functional relations between them and time (a scalar 
quantity) for the case where the object moves in a straight line with constant 
acceleration. The chapter concludes with a discussion of projectile motion, 
one of the simplest types of two-dimensional motion. 


3.2 SPEED AND VELOCITY 


Two words in English, “speed” and “‘velocity,’”” are used interchangeably to 
indicate how fast a body is moving. In physics we make a distinction between 
them. The word “speed” is defined as a scalar quantity and “velocity” is a 
vector quantity. Thus, the average speed (where average will be represented 
by a bar on top of the quantity involved) is the distance traveled in any 
direction, As, divided by the time At, or 


—— As 
speed = re (3.1) 


where 


A(anything) = final value — initial value 


Velocity is defined differently. Consider a particle moving in space. Let 
the particle be at point P in Fig. 3-1 at some initial time fp and at point P’ 
some later time t;. The initial position of the particle can be specified by a 
position vector fy obtained by drawing an arrow from the origin of the coor- 
dinate system to point P. Similarly, the position at the later time is specified 
by a second position vector 1; that results when an arrow is drawn from the 
Origin to point P’. The position at any other point in the motion is specified 
by a corresponding position vector r. We can now define the displacement 
vector Ar as the vector difference between the final and the initial position 
vectors, namely, Ar = r — fp (see Fig. 3-1). Correspondingly, we define the 
average velocity v as the ratio of the displacement vector to the time taken for 
the displacement to occur, namely, 


%— 1% Ar 


v= = — 
tp — to At 


(3.2) 


The distinction between speed and velocity is difficult to grasp at first, but it 
is extremely important. Consider the walk taken in Fig. 2-1. Suppose it took 
1h. Then, by definition, Eq 3.1, the average speed of walking was As/At = 


FIGURE 3-1 
The displacement vector Ar is 
obtained by drawing an arrow 
from the initial position vector r, 
to the final position vector r;. 


(4.0 mi + 5.0 mi)/1 h = 9 mi/h, whereas the average velocity was Ar/At = 
6.4 mi/1 h = 6.4 mi/h in the direction 39° north of east. Consider an even 
more extreme example. Suppose a race car is traveling around a circular track 
of 1-mi diameter and its speedometer reads 100 mi/h. This is the speed. The 
time taken to reach any point is, from Eq. 3.1, At = As/speed. Because the 
track length is 7 xX diameter = 3.14 mi, the time to complete one circuit is 


_ 3.14 mi 
~ 100 mi/h 


At = 3.14 x 10°*h 

and the time to go halfway around is 1.57 x 10~* h. However, the car’s 
average velocity by the definition of Eq. 3.2 depends on its position. When 
the car has gone halfway around, the vector displacement from the starting 
point is the diameter or 1 mi. Hence, its average velocity to that point is 


_ 1 mi 
~ 1.57 x 10-2h 


V = 63.7 mi/h 
In one complete circuit, as the car passes the starting point its vector dis- 
placement is zero and hence 


0 mi 


Se ml 
314x102h  2™ 


Vv 
This seeming contradiction has occurred because we have taken large dis- 
placements for Ar. If we shrink the displacement to a minute amount by 
taking the limit 
Ar dr 


v= im —=-— 


(3.3) 
then the magnitude of the velocity, which is now called the instantaneous 
velocity, at any point on the track will equal the speed. This can be seen in 
Fig. 3-2, where we notice that as As becomes smaller, the difference between 
As and the corresponding Ar decreases. We should also note that in the limit 


y } ¥ 


FIGURE 3-2 
A curved path of a car traveling clockwise. As is the distance traveled by the car, and 
Ar is the displacement vector between the position of the car ry, at some instant and 
the position r, at the initial time. As the distance As becomes smaller, its value ap- 
proaches Ar. 
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where Ar becomes infinitesimally small, it becomes tangential to the path, 
and therefore the direction of the instantaneous velocity is the tangent to the 
path. Thus, while the magnitude of the instantaneous velocity remains equal 
to the speed, the direction part of the instantaneous velocity is changing. 
Velocity is a vector because it is equal to a vector displacement divided by 
time, which is scalar, and the division of a vector by a scalar does not remove 
the vector property. We should note that by definition, Eq. 3.3, the instan- 
taneous velocity v is the first derivative of the position vector with respect to 
time. It should also be pointed out that because Eq. 3.3 is a vector equation, 
it holds for each of the cartesian components of the vectors v and r, namely, 


_ dx _ dy _ dz 
a a and a 


Ox 
where v,, v,, and v, are the cartesian components of v and x, y, and z are 
those of r. 


3.3 ACCELERATION 


In the preceding section we introduced a convention in which A is a mea- 
surable change between a final value and an initial value and d is used for 
an infinitesimally small change. 

If there is a velocity change Av in a certain time At, we define the average 
acceleration as 


5. ov 
ae (3.4) 
or 
5 - Mow 
te — to 


where the subscripts f and o represent final and initial values, respectively. 
Usually in a problem we start our stopwatch at fp = 0, so the elapsed time 
is simply t; and we drop the subscript f. We may define an instantaneous 
acceleration as 


a= lim — = — (3.5) 
which is the first derivative of v with respect to time. Substituting Eq. 3.3 for 


v, we write 
Pee: 7 2 3.6 
— dt \dt) ~~ dP 8) 


which is the second derivative of r with respect to time. 


Example 3-1 


The position of a body on the x axis varies as a function of time according to 
the following equation 


x(meters) = (3t + 2t7)m 


Find its velocity and acceleration when t = 3 sec. 


Solution Because the body moves in a straight line, r = x. From Eq. 3.3 
dx ad 
aS as Se t + 2) ee 
v a di (3 2t*) = (3 + 4t) m/sec 


The velocity of the body at t = 3 sec is therefore 
v(t = 3 sec) = 3 + 4 X 3 = 15 m/sec 
From Eq. 3.5, 


1-2 
dt 


Notice that a is a constant, and therefore a(t = 3 sec) = 4 m/sec’. 


d 
= — + = 2 
Tt (3 + 4t) = 4 m/sec 


3.4 LINEAR MOTION 


Because displacement, velocity, and acceleration are vectors, we may treat 
them by the method of cartesian components introduced in Chapter 2. First, 
let us consider motion only in the direction of a single component, for ex- 
ample, the x direction, that is, motion in a straight line. 

If we start timing an object moving in the x direction when it starts from 
or is passing the x = 0 point, we may write Eq. 3.2 as 


= _x- 0 
* t-0 
or 
x = Ot (3.7) 


Because in this section we will be talking about motion in one direction, we 
will drop the subscript x from the velocity. 

Equation 3.7 results from the definition of average velocity; thus it holds 
in all cases whether or not the acceleration is constant. In the remainder of 
this chapter, we will consider only constant acceleration. 

The derivative of a variable, for example, the velocity v, with respect to 
a second variable, for example, time t, represents the instantaneous rate of 
change of the first variable (v) with respect to the second (f). Thus, the accel- 
eration as defined in Eq. 3.5 is the rate of change of the velocity with time. 
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If the acceleration is constant, the change in the velocity during the first, 
second, third, and all succeeding seconds of the motion will be the same and 
equal to the acceleration a. Thus, if the motion lasts t seconds, the change in 
the velocity Av = v — wv = at, where v is the final velocity and vp is the 
initial velocity. We can rewrite this result as 


V = Uo + at (3.8) 


If velocity v is plotted against time ft, it is seen that Eq. 3.8 is a straight 
line, as indicated in Fig. 3-3. The slope of this line is the constant accelera- 
tion a. FIGURE 3-3 

With a little bit of thought, we can obtain another important relation. 
When the velocity increases at a constant rate as in Eq. 3.8 and Fig. 3-3, the 


Plot of v versus t for constant ac- 


ae —- celeration. 

average velocity is one half the sum of the initial velocity vp and the final 

velocity v, namely, 
—_ 0 es Vo ae i) + Up 
v= > (3.9) x = —— f 

and Eq. 3.7 becomes 

0 + 0 —— i T Un 

x= aera (3.10) i= aa 


The three equations (3.7), (3.8), and (3.9) define linear motion for constant 
acceleration. Often, however, at least two of these, and sometimes all three, 
must be used to solve a problem. It is convenient, therefore, to combine these 
three equations to obtain two auxiliary ones and have all five available (thereby 
avoiding the necessity of solving simultaneous equations). We obtain the 
auxiliary equations in the following way. 
From Eq. 3.8 write 
UV — Uo 


— 
a 


and substituting it into Eq. 3.10 we obtain 


_ (v + Uo) (UV — 2%) 
7 Z a 


and 
v= — vg = 2ax (3.11) 
If we substitute Eq. 3.8 for v in Eq. 3.10, we obtain 


_ Uo + at + U 
2 


and 
1 

k= Ut+ sab (3.12) 

We may derive these equations more formally by integration. By defi- 
nition 

_ dv 


at 


(3.5) 
Rearranging terms and integrating, we write 


v t 
| dv = | a dt 
vo 0 


This equation holds in general whether or not acceleration is a constant. In 
the present case, acceleration is taken as constant, so a can be taken out of 


the integral and we write 
v t 
i du =a J dt 
vo 0 


VU — Uo = at 


This integrates to 


and 
V = Uo + at (3.8) 
From the definition 
dx 
v= at (3.3) 


Substitute Eq. 3.8 for v — 
| x t t t 
[ax Jo + at)dt = % [dt +a edt 
Xo 0 0 0 


| 
X — Xp = Uot + 5 at’ (3.12) 


Note that in this formulation of Eq. 3.12 we have not required that x = 0 at 
ft = 0 as in the previous algebraic derivations. 
We may use the chain rule to write 


Qa=— = ——_— = V— (3.13) 
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v — Up + at 


Ay ae 
= Xo = Vol hs 7 at 


— 
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Or 


[ovaaalia 


v2 2 
7 = a(x — Xo) 
v? — vg = 2a(x — Xo) (3.11) 


Equations 3.7 through 3.12 have been derived for motion in the x direction. 
Similar equations can simply be written for motion in the y and z directions 
when the components of the acceleration in these directions are also constant. 

There is one important thing to be noted here. In the solution of motion 
problems we must assign vector directions. Suppose we observe a boy throw- 
ing a ball as we look through a transparent piece of graph paper and draw 
lines of motion and displacement. We could lie on our side or stand on our 
head and draw the lines without having any effect on the boy and his ball. 
Therefore, choosing a particular coordinate system is a matter of personal 
convenience. The upward direction could be the positive y direction or the 
negative y direction, or even a direction at an angle on the graph paper, 
although we will always try to choose a system that will minimize the cal- 
culational steps. It is important to note that once we choose a coordinate 
system, all parameters have their vector direction controlled by it. If we choose 
the positive y direction as up and the boy throws the ball straight up, then 
the vector displacement from the ground to its highest position is positive. 
During its upward travel, because velocity is the displacement divided by the 
scalar time, it too is positive. The only motion is in the y direction, so we 
therefore use y, v,, and a, in the equations previously derived. Thus, Eq. 
3.8 is 


Vy = Voy + yt (3.8') 


We observe that in throwing the ball upward the largest value for the 
magnitude of the y velocity occurs as it leaves the boy’s hand; then the ball 
begins to slow down until the upward velocity is zero. The only way that 
this can occur is if a, in Eq. 3.8’ is negative. Now a, is the acceleration caused 
by the force of gravity acting on the ball, and we will see in the next chapter 
that because the force of gravity is downward then the corresponding accel- 
eration must also be downward. The acceleration caused by gravity is usually 
written as the symbol g and has the approximate sea-level value g = 9.8 m/sec’. 

When solving problems, the best approach is to tabulate what is known 
and what is to be found and select the appropriate equation. 


Example 3-2 


A boy throws a ball upward with an initial velocity of 12 m/sec. How high 
does it go? 


wv — uv = 2a(x — X) 


Multiflash photograph of a fall- 
ing ball. Note the increase in the 
distance traveled by the ball be- 
tween flashes as it falls. This re- 
flects an increase in the velocity 
of the ball caused by the down- 
ward-directed gravitational ac- 
celeration. 
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Solution We choose the starting point as the origin and the upward 
direction as positive. Because velocity is a vector displacement divided by 
time, upward velocity is also positive. The force of gravity is in the negative 
y direction, so the sign of the acceleration is therefore negative. First list what 
is known and what is to be found 


I 


Voy = 12 m/sec, v, = 0 (at its highest point), a, = g = —9.8 m/sec? 


y=? 


We select the y equivalent of Eq. 3.11 because all the quantities in that equation 
are known except y, the quantity that we want to find 


Uy — Udy = ayy 
Solving for y, we write 

_ Vy — Voy 
4 2 Ay 

Substituting the numerical values for the quantities in the equation, 


_ 0 — (12 misec)’? 
49 (—9.8 m/sec’) 


7.3m 


Example 3-3 


A boy throws a ball upward with an initial velocity of 12 m/sec and catches 
it when it returns. How long was it in the air? 


Solution As in the previous example, we choose the starting point as 
the origin and the upward direction as positive. 


Voy = 12 m/sec, a, = —9.8 m/sec”, y = 0 (vector displacement is zero 
because it returns to his hand), t = ? 


Select Eq. 3.12 
1 2 
Y = Voyt + 5 Ayt 
Using the fact that y = 0, Eq. 3.12 becomes 


1 
0 = Voyt = ayt? 


2 
We see immediately that if we divide both sides of the equation by t, we 
obtain 
1 
O = voy + = ayt 


2 
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and 


2 Voy 


— 


ay 


2 X 12 m/sec 
— 9.8 m/sec? 


2.45 sec 


In this example, the ball returned to its starting point, so the displacement 
y was zero. This simplified Eq. 3.12, because dividing both sides by t linearized 
the equation. If the ball had landed on a roof, then the left side of Eq. 3.12 
would not be zero and the equation to be solved would be quadratic. 


3.5 PROJECTILE MOTION 


We have treated motion in one dimension in the preceding section. Suppose 
we have a smooth, frictionless wall in the x-y plane. If we set an object in 
motion along the wall, we find experimentally that the object is accelerated 
downward in the y-vector direction but that there is no acceleration in the 
x-vector (horizontal) direction. That is, the object moves in the x direction 
with its constant initial x velocity but its y velocity is increasing downward 
owing to the acceleration of gravity. If we now perform the same experiment 
with an imaginary wall, we have what is called projectile motion. The char- 
acteristic in the coordinate system in which we are working is that the x and 
y motions and velocities are at right angles to each other and that there is an 
acceleration only in the y direction, a,; there is no acceleration in the x direc- 
tion. The equations of motion in the x and y directions are therefore 


Xx = Voxt 


i 2 
Y = Uoyt + 5 Ayt 


In view of the foregoing, projectile problems are treated as two separate 
linear motion problems, one in the x direction and another in the y direction, 
with only time as the common element. 


Example 3-4 
A ball moving at 2 m/sec rolls off of a 1-m-high table, Fig. 3-4. How far 
horizontally from the edge of the table does it land? 


Solution The ball will continue moving in the x direction for as long as 
it is in the air. We can use Eq. 3.7 to determine the x coordinate of the ball 
as it lands 


xf = xt 


Multiflash photograph of two 
falling balls: One released from 
rest and the other launched with 
an initial horizontal velocity. The 
vertical position of the two balls 
at any given time (time of the 
flashes) is the same for both, in- 
dicating that the vertical motion 
is unaffected by the initial hori- 
zontal velocity given to the sec- 
ond ball. 


Cox = ¢ isec 


FIGURE 3-4 


Example 3-4. The edge of the ta- 
ble is chosen as the origin of the 
x—y coordinate system. 


where ft; is the time that the ball is in the air. Note here that, because the 
velocity in the x direction is unchanged, 0, = vo, and therefore 


X, = 2 m/sec ff 
t; is the time when the y coordinate of the ball becomes —1 m. We have 


a, = —9.8 m/sec’, t, = ? 


y = —1m, y 


Using Eq. 3.12, 


Voy = 0, 


a 1 2 
Y = Voyt + 5 yt 


Because Up, = 0, this equation becomes 


1 


y = 5 Aye 
and 
poe 
ay 
_— =e 
ff ~ V— 9.8 m/sec? 
= + 0.45 sec 


and because in deriving Eqs. 3.9 through 3.12 we chose ¢t = 0 as the initial 
time, only the positive root is acceptable, therefore, 


xX, = 2 m/sec X 0.45 sec 
= 0.9m 


Let us examine a specific case of projectile motion along level ground. 
We will find the general formula for the distance that a person can throw a 
ball or that a gun can fire a projectile. The variables are shown in Fig. 3-5a 
and the initial velocity components in Fig. 3-5b. The distance x that the pro- 
jectile travels just before it strikes the ground is 
Xp = Vyty = Voxts (because Vv, = Vox) 

From Fig. 3-5b 

Vox = Vo COS O 
and therefore 

Xf = Uo cos 6 ty 
We determine the time in the air from the y-direction problem when the 


projectile is thrown upward with an initial velocity of vp, = Uo sin 6 and is 
acted on by the acceleration of gravity, a, = g = —9.8 m/sec’. At the end of 
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Cannonball trajectories for var- 


ious launching angles as con- 
ceived by Diego Ufano in 1621. 


(b) 

FIGURE 3-5 

(a) Projectile motion on level 
ground with v, and 6 the initial 
velocity and angle, respectively. 
(b) The x and y components of 
the initial velocity vo. 
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its trajectory the vector displacement is y = 0, So we may write 
Vii = 0, Voy = Vo sin 0, ay = —9.8 m/sec?, Ey = ? 


using 


1 2 
Y = Vot + > a,t 


2 (3.14) 
1 
0 = usin 6 t — 5 Sif 
Z 
tp = — Uo sin 0 
& 
Substitute this for ¢; in the equation for x-direction motion 
va 
xX, = <° 9 sin 6 cos 0 
Substitute the trigonometric relation 2 sin 6 cos 6 = sin 26 and obtain 
U6 
Xf = eo sin 20 (3.15) 


We can readily find from Eq. 3.15 the angle at which the projectile should 
be thrown (or fired) to achieve a maximum distance in the x direction for a 
fixed value of vp. The only variable is the angle in the term sin 26, and the 
sine has a maximum value of unity when the argument is 90°. Therefore, x; 
of Eq. 3.15 is maximum when 26 = 90° or 6 = 45”. 

Note that Eq. 3.15 is valid only when the projectile returns to the starting 
level, because we set y, = 0 in Eq. 3.14. If it returns to some other level, then 
the quadratic equation in time must be solved (see problems 3.16 and 3.17). 


Example 3-5 


A boy stands on the edge of a roof 10 m above the ground and throws a ball 
with a velocity of 15 m/sec at an angle of 37° above the horizontal. How far 
from the building does it land? See Fig. 3-6. 


Solution Let us choose the edge of the roof as the origin of the coordinate 
system. There is no acceleration in the x direction, so we may simply write 
the following for x distance 


xp = Vy — Voxt 


Up cos 37° = 15 m/sec X 0.8 = 12 m/sec 


Vox 


xX, = 12 msec ft, 


¥, = ld msec 


FIGURE 3-6 
Example 3-5. 
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We now need to find the time in the air, which is a motion problem in the 
y direction only. 


yy = —10m, vy, = 15 msec sin 37° = 15 m/sec X 0.6 = 9 m/sec 


I 


a —9.8 m/sec’, t, = ? 


y 
We use Eq. 3.12 
1 2 
Y = Uot + 5 4yt 
If t = t;when y = y, = —10 m, this equation can be written as 


1 
5 Ay HF + Voy ty — yy = 0 


Solving this quadratic equation for ft, 


1 
= Voy + | voy? — (4) (; “y) (-y) 


Substituting the numerical values for y, Voy, and a, 


—(9 m/sec) + V(9 m/sec)? + (2)(—9.8 m/sec’)(— 10 m) 


—9.8 m/sec? 


= 
t- = 2.6 sec, —0.78 sec 


Because we have started timing when the ball is thrown, the negative time 
solution is rejected because it has no physical meaning to this problem. Sub- 
stitute the positive time of 2.6 sec into the equation of motion in the x direction 
and obtain = 


x, = 12 m/sec X 2.6 sec = 31.2 m 


It will be instructive to find the magnitude of the velocity and the angle at 

which the ball strikes the ground. This is obtained from the components of 

the velocity just before it hits, as shown in Fig. 3-7. We see from the vector 

component method that the ball’s vector velocity just before striking the 

ground is given by the final value of its components, v7, and Uy. Because we — FIGURE 3-7 

have a right triangle, we may use the pythagorean theorem Example 3-5. Diagram to calcu- 
late the angle at which the pro- 

UF = Ue + Uy jectile strikes the ground. 
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and the angle 6 at which it strikes the ground is 


v 
8? = arctan -y 


Ofx 


Vp, = 12 m/sec because it is unchanged during the ball’s flight. We must 
therefore find v,;,. We obtain this from Eq. 3.8 in the y direction. 


Voy 
Vy = Voy + At, 


Vy = 9 m/sec — 9.8 m/sec? X 2.6 sec 


9 m/sec, t; = 2.6 sec, a, = —9.8 m/sec’, v%, = ? 


Uy = — 16.5 m/sec 
Then 
vy = V(12 m/sec)? + (—16.5 m/sec)* = 20.4 m/sec 
|—16.5 m/sec] 
= oO 4° 
? = arctan ioacca 5 


where @ is the angle indicated in Fig. 3-7. 


PROBLEMS 


3.1 A student drives to college 15 km away from 
home in half an hour. After classes, he returns home 
in 20 min. Find (a) the average speed on his way to 
college, (b) the average speed for the round trip, (c) 
his average velocity for the entire trip. 


3.2 The position of a particle moving along the x 
axis is given by x = 3 + 17t — 5t?, where x is in 
meters and t¢ is in seconds. (a) What is the position 
of the particle at t = 1, 2, and 3 sec? (b) At what 
time does the particle return to the origin? (c) What 
is the instantaneous velocity at tf = 1, 2, and 3 sec? 
(d) At what time is the instantaneous velocity of the 
particle zero? (e) What is the velocity of the particle 
as it passes through the origin? (f) What is the ac- 
celeration of the particle as it passes the origin? 


3.3 The position of a particle moving in a straight 
line is given by x = 5 + 2t + 4t* — BP. (a) Find an 
expression for the instantaneous velocity as a func- 
tion of time. (b) Find the position of the particle at 
t = 0, 1, 0.1, and 0.01 sec. (c) What is the average 
velocity between t = 0 sec and t = 1 sec, between 
t = 0 sec and t = 0.1 sec, and between t = O sec 
and t = 0.01 sec? (d) What is the instantaneous 
velocity at f = 0 sec? (e) What conclusion do you 
draw from the answers in (c) and (d)? 


3.4 Acar is driving east at 60 km/h, it then makes 
a turn and travels north at 50 km/h. If it takes 2 sec 
to make the turn, what is the average acceleration 
of the car as a result of the turn? 
(Answer: 10.85 m/sec’, directed 39.8° north of 
west.) 


3.11 


3.5 Consider the particle of problem 3.3. (a) Find 
an expression for the acceleration of the particle as 
a function of time. (b) What is the instantaneous 
velocity of the particle at t = 0, 1, 0.1, and 0.01 sec. 
(c) What is the average acceleration between t = 0 
sec and t = 1 sec, between t = 0 secandt = 0.1 
sec, between t = 0 sec and t = 0.01 sec? (d) What 
is the instantaneous acceleration at t = O sec. (e) 
What conclusion can you draw from the answers in 


(c) and (d)? 


3.6 A car starts from rest and accelerates uniformly 

to a speed of 25 m/sec in 8 sec. (a) What is the ac- 

celeration? (b) How far did it travel in the 8 sec? 
(Answer: (a) 3.13 m/sec’, (b) 100 m.) 


3.7 A rocket starting from rest rises to a height of 
20,000 m in 60 sec. (a) What was the average velocity 
of the rocket? (b) Assuming that the acceleration was 
constant, what was the acceleration of the rocket? 
(c) What was the velocity and the height of the rocket 
after 30 sec? 


3.8 A boy stands on the edge of a building 10 m 
above the ground and throws a ball upward with 
an initial velocity of 12 m/sec. It misses the roof on 
the way down and falls to the ground. Find how 
long the ball was in the air and its velocity just before 
it strikes the ground. (Hint: take y = Oatt = O and 
y final as —10 m). 

(Answer: 3.11 sec, — 18.44 m/sec.) 


5-9 A car moving at 25 m/sec strikes a tree, and the 
tree is seen to dent the front by 0.5 m. Assume that 
the deceleration of the car was constant. Find the 
deceleration and time it took the car to stop. 


3.10 A car moving with constant acceleration cov- 
ers a distance of 50 m between two points in 5 sec. 
Its velocity as it passes the second point is 16 m/sec. 
(a) What is its acceleration? (b) What was its velocity 
as it passed the first point? 

(Answer: (a) 2.4 m/sec’, (b) 4.0 m/sec.) 


A ball is dropped from the roof of a building. 

It is observed to take 0.2 sec to pass by a window 

2m high. How far is the top of the window from 
the roof? | 

(Answer: 4.15 m.) 
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A ball is dropped from a bridge 60 m above 
the surface of the water. One second later, a second 
ball is thrown down with an initial velocity vo. Both 
balls strike the water at the same time. (a) How long 
were the balls in the air? (b) What was the initial 
velocity of the second ball? (c) What were the ve- 
locities of the balls as they struck the water? 

(Answer: (a) 3.50 sec, 2.50 sec, (b) — 11.76 m/sec, 
(c) —34.30 m/sec, — 36.25 m/sec.) 


A motorcycle is waiting at an intersection. As 
the light turns green it starts with an acceleration of 
20 m/sec”. At that same moment a car, moving with 
constant velocity of 120 m/sec overtakes and passes 
the motorcycle. (a) How far from the traffic light will 
the motorcycle overtake the car? (b) What is the ve- 
locity of the motorcycle at that point? 


3.14 A girl drops a flowerpot from a window 50 m 
above the ground. At the same instant a boy directly 
under the flowerpot throws a stone with an upward 
velocity of 30 m/sec. (a) How far above the ground 
will the stone hit the pot? (b) How long after the 
flowerpot was dropped does the hit take place? (c) 
What is the minimum velocity with which the stone 
must be thrown for the hit to occur? 

(Answer: (a) 36.4 m, (b) 1.67 sec, 

(c) 15.65 m/sec.) 


3.15 An electron is set in motion horizontally with 
a velocity v, = 4 X 10° m/sec. How far will it fall 
while traveling a horizontal distance of 10 m? 


3.16 A boy standing on the ground throws a ball 
at an angle of 37° above the horizontal with a velocity 
of 15 m/sec. It lands on the edge of a flat roof of a 
building 3 m high. How far horizontally from the 
boy does it strike the roof? 

(Answer: 16.86 m.) 


3.17 A boy standing on the ground throws a ball 
at an angle of 37° above the horizontal with a velocity 
of 15 m/sec. It strikes the wall of a building 16 m 
away. How high above the ground is the point at 
which the ball strikes the building? 

(Answer: 3.32 m.) 


3.18 An artillery gunner wishes to have a projectile 
land at a point on level ground 20,000 m away from 
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the gun. If the muzzle velocity is 500 m/sec and the 
muzzle is assumed to be at ground level, at what 
angle above the horizontal should the gun be aimed? 


3.19 If the gun of problem 3.18 is on a hill 30 m 
high and the same angle of elevation is used, how 
far beyond the target will the projectile land? 


3.29 A batter at home plate hits a baseball 1 m 
above the ground. The ball leaves the bat in the 
direction of an outfielder with a velocity of 30 m/sec 
at an angle of 30° above the horizontal. Half a second 
after the ball is hit, the outfielder 100 m away from 
home plate runs to catch the ball. How fast must he 
run to catch the ball just before it hits the ground? 

(Answer: 7.16 m/sec.) 
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4.1 INTRODUCTION 


In this chapter we will consider Newton’s three laws of motion. Although 
when first propounded they were postulates, they have since been verified 
by experiment in so many ways that they are now considered Laws of Nature. 
A careful observer will note that they are not quite independent laws—that 
is, one implies another, although they are usually listed separately as if they 
were independent. There is one consistent word in these three laws and that 
is “body.” We sometimes speak of this as the newtonian body. Notice that 
body is singular. In a given physical situation we must first define the new- 
tonian body, which may often be a mathematical point. If the situation has 
two bodies, then Newton’s laws must be applied separately to each. Often 
we solve complicated systems of solids on a computer that can remember 
and vary 10,000 atoms in a solid. We require an equal number of applications 
of Newton’s laws, one for each, although the complexity of the solution of 
that number of simultaneous equations is often reducible by symmetry of 
behavior. 

In addition to using the basic terms of Chapter 1—length, mass, and 
time—we will discuss the term force, which we have used a bit freely. A 
precautionary word might be said of these. 

Length, at least on earth, can be measured by adopting a standard such 
as the length of the king’s foot, or the length of a standard bar of metal, and 
comparing other lengths with it. 

Time can be measured by divisions of the motion of the earth either in 
rotation about its own axis or in revolution about the sun. Again, a standard 
has been established by our environment. An early recorded question about 
what time really means is found in a discussion by St. Augustine in his 
Confessions around 400 A.D. “For so it is O Lord, my God, I measure it, but 
what it is I measure I do not know.” 

Mass is an even more obscure property. Newton first referred to it as 
inertial mass, that property of a body which resists being set in motion if a 
force is applied. But what is force? It is that entity which under certain con- 
ditions, to be discussed shortly, can change the state of motion of a mass 
when it acts on it. And thus we find ourselves in a circular argument. We 
may define mass if force is known, or we may define force if mass is known. 
The customary approach is to start with mass and define force through the 
motion it causes on mass. This enables us to use a combination of dimen- 
sions—length, mass, and time for force. So if we choose an arbitrary object, 
and agree that it be the standard of mass (the kilogram was selected), we will 
be able to evaluate the mass of any other body by means of Newton’s second 
law. We will shortly see how this is done. 


4.2 NEWTON’S LAWS 


We will not state the three laws exactly as Newton did; instead we will use 
modern English so that we may discuss them with no misunderstanding. 


Isaac Newton (1642-1727). 


First Law: Every body of matter continues in a state of rest or moves 
with constant velocity in a straight line unless compelled by a force to 
change that state. 


Second Law: When net unbalanced forces act on a body, they will produce 
a change in the momentum (this concept will be defined shortly) of that 
body proportional to the vector sum of the forces. The direction of the 
change in momentum is that of the line of action of the resultant force. 


Third Law: Forces, arising from the interaction of particles, act in such 
a way that the force exerted by one particle on the second is equal and 
opposite to the force exerted by the second on the first and both are 
directed along the line joining the two particles. (Or, as usually expressed, 
action and reaction are equal and opposite.) 


In the first law Newton implies what we call frames of reference. He has 
implied that a body at rest with respect to an observer can be analyzed in the 
same way as one moving past the observer at a constant velocity. That is, 
the same force applied to either body will change its state of motion in the 
same way. 

The third law of action and reaction implies that there can be no single 
force in isolation. A force must act on a body and, when it does, the body 
acts on the source of the force. Consider a bat striking a ball. The bat is a 
mass in motion with a certain velocity. If it strikes the ball, it exerts a force 
on it and changes the ball’s state of motion. At the instant of striking, the 
ball exerts a force on the bat in the opposite direction, thereby changing the 
bat’s state of motion by slowing it down. If the bat misses the ball, it swings 
through the air with no appreciable change in motion because it has not 
exerted force on any body of substantial mass and therefore nothing has 
changed its state of motion. Strictly speaking, the bat is striking air molecules 
during its motion, thereby changing their state of motion and slightly reducing 
its speed. There is another force acting to change the direction of the bat’s 
motion that causes it to move in an arc. If it were not for the force of the 
batter’s hands the bat’s motion would continue in a straight line. That is, if 
the batter lets go of the bat, it will fly off in a straight line; we will consider 
this phenomenon later. 

The first and third laws set the stage and the conditions for the second 
law. To appreciate Newton's approach, let us briefly look at it in historical 
perspective. It is common knowledge that Galileo (1564-1642) was tried for 
disobeying a directive from Catholic Church authorities not to state or publish 
his views of motion of the solar system. The next great natural philosopher 
to take a keen interest in the laws of mechanics was the Frenchman René 
Descartes (or in Latin, Renatus Cartesius) (1596-1650). Our cartesian coor- 
dinate system bears his Latin name. He decided from his studies that the 
most important property of a body in a mechanical system was what he called 
its momentum, the mass times the velocity, mv. We will deal with this in 
Chapter 6. Being a careful observer, he could not help but notice what had 
happened to Galileo, so he declined to publish his thoughts on mechanical 
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Galileo Galilei (1564-1642). 


René Descartes (1596-1650). 
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systems. These were embodied in his manuscript Le Monde, which was first 
published in Amsterdam in 1662, 12 yr after his death. The Reformation had 
by then reached Holland, so the fear of recrimination by ecclesiastical au- 
thorities was of little concern. 

Meanwhile, back in England, the Great Plague was raging and everyone 
who had a relative in the country escaped from the city of London. Among 
these was Newton, who, according to legend, was letting apples fall on his 
head. A copy of Le Monde was given to him, and he was able to make the 
creative step shortly thereafter, although his approach was somewhat indirect. 
The concept of momentum, mv, involves constant or zero velocity, which is 
embodied in his first law. If a force is applied against a body, the body resists 
with what he called an inertial force, namely, resistance to having its state of 
motion (momentum) changed. This is the principle embodied in the third 
law. If, however, a force is applied to a body for a given length of time, At, 
the momentum will be changed by Amv. He called the product of force and 
time an impulse, and he wrote the basic principle of the second law that the 
application of an impulse to a body caused a change in its momentum or 


FAt = Amv (4.1) 


He recognized, however, that direction was equally important. That is, if an 
impulse was applied in the x direction, the momentum of the body would 
be changed in only the x direction. Thus Newton introduced the requirement 
of vectors in calculations. 

If we divide Eq. 4.1 by At and consider the mass of the body to be 
constant, we may write 


F = m— (4.2) 


where we must recognize that in most physically realizable situations F is not 
a constant but rather an average force; for example, the force of a bat against 
a baseball. In Chapter 3 we defined a A as a measurable quantity. Newton 
realized that he wanted to have a form of Eq. 4.2 for very small, or instan- 
taneous, values of time. Because only tentative beginnings of calculus existed 
at that time, he proceeded to improve the methods. (G. Leibniz, a contem- 
porary German mathematician, also refined the calculus, independently.) 
Combining the result of Eq. 3.5, a = dv/dt, with Eq. 4.2, we may write 


F = ma (4.3) 


To use Newton’s great second law properly, we must include formally the 
two additional concepts: (1) This is a vector equation, and (2) the force, which 
is now instantaneous so the average is not required, is actually the net, or 
unbalanced, force in a given direction. We write this as the algebraic sum in 
each direction. 

The forms of Newton’s law that we will use are therefore 


2F, = ma,, 2XF, = ma, 2XF, = ma, (4.4) 


- Wha 


The concept of summation of forces can be understood from elementary 
examples. Consider a tug-of-war with equal numbers of people of equal strength 
pulling on a rope in opposite directions. If we consider a point in the center 
of the rope as the newtonian body, we conclude that the sum of forces acting 
on that body is zero and we will observe no acceleration. If a small child joins 
one of the sides and pulls, then the sum of forces is no longer zero, but 
instead there is a net force in the direction that the child pulls with magnitude 
equal to the force the child exerts. Hence, the rope will be accelerated in his 
direction. 

It is convenient at this point to introduce the word tension, which is used 
to convey the transmission of a force through a rope. In this example, none 
of the contestant’s hands are on the newtonian body at the center of the rope; 
yet, if the center is not being accelerated in either direction, the sum of forces 
at that point must be zero. If we insert a spring scale in the rope on either 
side of the center, we note that the same force is present, an example of action 
and reaction. Why does the spring scale not read zero, since we have just said 
that the sum of the force is zero? The answer lies with the third law of action 
and reaction. If you pull on one end of a spring, the other attached end is 
being pulled equally and in the opposite direction. Thus, although the sum 
of the forces is zero and the scale does not accelerate, the spring is nevertheless 
stretched, which causes the scale to read the value of whatever force is being 
applied to either end. We may perform the same measurement at any other 
point along the rope and the scale will read the same. The measured force 
in the rope at any point is called the tension in the rope. We could lower a 
curtain at the middle of the rope and tie one end of the rope to a wall and 
send that team home. The other team would not be aware of it, and a mea- 
surement in the rope would indicate the same tension. Suppose we went 
behind the curtain and cut the rope: How would Newton’s law apply? 


4.3 MASS 


Let us now reconsider our dilemma of defining mass and force. At this point 
we know from Chapter 3 how to measure acceleration, and we have stated 
that everyone has agreed to accept a certain block of material as having a 
mass of 1 kg. We do not yet know how to measure force, but we can devise 
a system to reproduce a given force, such as a pull on a rope with a spring 
scale to measure the same tension. If we exert this force on the standard 
kilogram, mo, with no-other forces such as friction or gravity to interfere, we 
can measure an acceleration ag. If we apply this same force to a different mass 
m,, we measure a different acceleration a;. From Eq. 4.3 we may write for 
each experiment 


F = Mp ao 


F = m, a, 
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tii 
The standard kilogram, a plati- 
num-iridium cylinder kept at the 
International Bureau of Weights 
and Measures in Sévre, France. 
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and, equating the two because the forces are equal, we have 


t= (4.5) 


a relation independent of the value of the force. We thus have a method of 
measuring the mass of any other body in relation to a standard mass. 
The unit of force can be defined in terms of mass, length, and time using 
Eq. 4.3. 
F 


ma 


on 


where brackets contain the dimensionality of the quantities involved and M, 
L, and T stand for dimensions of mass, length, and time, respectively. Because 
the equation must balance dimensionally, force has units of mass x length/time? 
or kilogram-meter per second? 
ML 
(re) 


In the SI system of units, this combination of units is called newton (N) for 
simplicity. A force of 1 N is that force which causes a mass of 1 kg to be accelerated 
at a rate of 1 m/sec? (or 2 kg accelerated at 0.5 m/sec’, and so on). 


4.4 WEIGHT 


A simple way to determine mass is to weigh it on a balance scale. In this 
method, a balance consists of a rod pivoted in the center so that the weighing 
pan on each side is equidistant from the center. (In Chapter 8 we will see 
how. a balance scale may be constructed with arms of unequal length.) The 
unknown mass is placed on one side, and multiples or fractions of a standard 
kilogram are placed on the other side until a balance is achieved. In this way 
the magnitude of the unknown mass can be determined because both the 
unknown and known masses are being acted on by the same force, that of 
gravity. 

The force of gravity can be expressed in terms of Eq. 4.3 by measuring 
g, the acceleration resulting from gravity. This can be done by noting that 
the rate of free fall of all objects in a vacuum (to eliminate air resistance) at 
a given point on earth is the same. The downward acceleration at sea level 
is approximately the same at all locations, or g = 9.8 m/sec’. So the force on 
an object of mass m resulting from gravity is, from Newton’s second law 


F = mg 
and in the English language we call this force the weight of an object or 


Weight = mg (4.6) 


Weight = mg 
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Thus, for 1 kg 


Weight = 1 (kg)g(m/sec’) = g(kg m/sec’) 


g newtons 


and 1 kg weighs 9.8 N. On the surface of the moon the acceleration of gravity 
is about one-sixth that of earth, so the weight of 1 kg will be one-sixth its 
weight on earth. In outer space at great distance from all other objects, the 
gravitational force will be near zero, and the kilogram will have almost zero 
weight. But its mass is unchanged. It takes the same force to produce a given 
acceleration in space as it does on the moon or on the earth. 

In the English system we use units of pounds to express the weight of an 
object. Therefore, the pound is a force. Acceleration is ft/sec? and mass has 
units of 


Weight(pound) ——) 
Or 
9(ft/sec*) ft 


The unit of mass in the English system is called the slug (from sluggish). At 
the surface of the earth the acceleration of gravity is 32.2 ft/sec? and 1 lb weight 
has a mass of 1/32.2 slugs. 


4.5 APPLICATIONS OF NEWTON’S LAWS 


4.5a Zero Acceleration 


It is seen from Eq. 4.4 that when a = 0 ina given direction the sum of forces, 
or net force, in that direction is zero. This fact can be used to gain information 
about the forces acting on an object when the object is not accelerated. When 
several forces act on an object but their effects cancel so that the object is not 
accelerated, the object is said to be in equilibrium. 


Example 4-1 


A block rests on a table. What are the forces acting on the block? See Fig. 
4-1. | 


Solution Yake the upward direction (+ y direction) as positive. We know 
that there is a force downward equal to the weight of the block. But because 
the block is not accelerating there must be an equal force upward, which we 
will call N for normal force, so that the sum of forces in the y direction is zero. 
Note here that the word normal is used in the mathematical sense of the 
direction perpendicular to a plane. We write Newton’s law as 


ZF, = 0 


FIGURE 4-1 


N = mg Example 4-1. 
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So the table exerts a force equal and opposite to the weight of the block; the 
table exerts a force on the floor equal to the sum of its weight and that of the 
block, the floor exerts an equal and opposite force on the table legs, and so 
forth. 


Example 4-2 


A child pulls a toy boat through the water at constant velocity by a string 
parallel to the surface of the water on which he exerts a force of 1 N. What 
is the force of resistance of the water to the motion of the boat? See Fig. 4-2. 


Solution Let F be the force parallel to the water of the string and f be 
the force of resistance of the water. Let us take the direction of F as the 
positive x direction. Because constant velocity means zero acceleration, 


SF, = 0 
F-f=0 
f=F=1N 


Example 4-3 


Two ropes attached to a ceiling at the angles shown in Fig. 4-3 support a 
block of weight 50 N. What are the tensions T; and T2 in the ropes? 


Solution Note here that the ropes exert forces both on the block and on 
the ceiling. The newtonian body of our concern is one through which all of 
the forces pass, namely the block. We therefore use the tensions acting on 
the block. We first draw a vector diagram of the forces (tensions) as in Fig. 
4-4. If we examine the newtonian body, we see that it is not accelerating in 
either the x or y directions. We may therefore write 


SF, = 0, XF, = 0 


By the component method of Chapter 2, we find the x and y components of 
the forces and substitute them into the equations. 


=F, = 0 

T, cos 37° — T>, cos 53° = 0 

0.8 T, — 0.6T, = 0 

xF, = 0 

' T, sin 37° + T> sin 53° — 50N = 0 
0.6 T,; + 0.8 T, — 50N = 0 


FIGURE 4-2 
Example 4-2. 


FIGURE 4-3 
Example 4-3. 


mg = 50 N 


FIGURE 4-4 
Example 4-3. 


APPLICATIONS OF NEWTON'S LAWS 


We solve these simultaneously by substituting either T; or T2 from one equa- 
tion into the other. For example, from the =F, = 0 equation we get 


0.6T- 3 
T; = ——_+ 


0.8 4 


2 
Substituting into the second equation 


1.25 T, = 50 N 


T> = 40N and T; = = T; 


I 
3 
Z 


4.5b Constant Acceleration 


In a constant acceleration situation we must examine the motion of the new- 
tonian body in all of the cartesian directions. It may be accelerating in some 
directions but not in others. The direction or directions in which it is not 
accelerating may give additional information about the forces acting on the 
body. 


Example 4-4 


A child pulls on a string attached to a 1-kg toy boat at an angle of 45° with 
a constant force of 2 N (Fig. 4-5). The boat goes from rest to a velocity of 0.2 
m/sec in 0.5 sec. Assuming constant acceleration, what is the force of resist- 
ance of the water? 


Solution 
=F, = ma, 
From Fig. 4-5, this becomes 
2 N cos 45° — f = ma, 
(2 N)(0.71) — f = (1 kg)ax 


where we used the component of force in the direction of motion and f is the 
force of resistance of the water. We find a, by the method of Chapter 3. 


Vox = 0, Vg = 0.2 m/sec, tf = 0.5sec, a, = ? F = — 
f ie 
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— Up, 0.2 m/ — 0 m/sec au 
i Oe FIGURE 4-5 
tf 0.5 sec Example 4-4. 
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Substituting this result for a, gives 
1.42N — f = 0.40N 
f = 1.02 N 


Example 4-5 


A block of mass 8 kg is released from rest on a frictionless incline that is at 
an angle of 37° with the horizontal (Fig. 4-6a). What is its acceleration down 
the incline? 


Solution In this situation it is convenient to tilt our graph paper so that 
the x axis is along the incline, for that is the direction in which the acceleration 
is to be determined. The y axis will be perpendicular to the incline. The vector 
force diagram is shown in Fig. 4-6b. The only forces exerted on the block are 
mg downward and the normal force N on the block exerted by the plane that, 
as we indicated in Example 4-1, is perpendicular to the surface. The com- 
ponent of mg along the x axis, F,, is determined by dropping a perpendicular 
from the end of the mg-force vector to the x axis. The angle between mg and 
this perpendicular is 9 = 37° by the geometric rule that two angles are equal 
if their sides are mutually perpendicular: A is perpendicular to D, and B is 
perpendicular to C. Therefore, 


io 
sin 37° = — 


mg 
F, = mg sin 37° 


From Newton’s second law, Eq. 4.4, 


F, = ma, 
Fy 
a, = 
m 
mg sin 37° 
m 


= g sin 37° = 9.8 m/sec* X 0.6 


5.9 m/sec? 


Two important points can be seen in this simple problem: 


1. Because the acceleration is independent of the mass, all masses starting 
from rest at the same height on the same plane will have the same accel- 
eration and, therefore, reach the bottom at the same time. 


2. The acceleration is less than the acceleration of gravity because only a 
component of the force of gravity on the body is directed down the plane. 


FIGURE 4-6 
Example 4-5. (a) Diagram of the 
problem. (b) Force diagram. 
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Example 4-6 © 
Masses of 2 kg and 4 kg connected by a cord are suspended over a frictionless E sa 
pulley (Fig. 4-7a). What is their acceleration when released? 


Solution Before solving this problem, we note three important facts. 
First, because the pulley is frictionless, the tension in the rope is the same 
on both sides. If it were not, the cord would slide over the pulley until the (a) 
tensions were the same. Second, the tensions are not the same as in a static 
situation; that is, we cannot equate T = mg because =F, # 0. Third, there are 
two newtonian bodies and we must write an equation for each, but note that 
while m; moves upward with a positive acceleration, m2 moves with an ac- 
celeration having the same magnitude but directed downward. The force 
diagrams for the two bodies are given in Fig. 4-7b. 

For body m, we write 


2F, = mya ; 
T —- mg = ma FIGURE 4-7 
Example 4-6. (a) Diagram of the 
= m(g + a) roblem. (b) Force diagram for 
P & 
For body m, each of the masses. 
2F, = moa 


and, noting that because upward motion was chosen as positive for body 1, 
the downward acceleration of body 2 must be negative, we write 


T — mg = m2(—4) 
T = mg — 4a) 


Substituting the T from the body 1 equation, into the equation for body 2, as 
the tensions are the same, we obtain 


m(g + a) = mg — 4) 
Rearranging terms, 


a(m, + M2) = g (m2 — mM) 


ae Mz — ™ 
~ 8 Mm, + My 
4kg —2k 
= 9.8 m/sec* X te = 3.3 m/sec? for body 1 
and 
a = —3.3 m/sec? for body 2 
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4.6 FRICTION 


We have to exert a steady force to drag an object at constant velocity across 
the floor. Because the velocity is constant, the acceleration is zero and the 
sum of forces on the object is correspondingly zero. This means that there is 
a force equal and opposite to the force that we exert that resists the motion 
of the object. However, our tug-of-war example does not apply here, for if 
we stop pulling the object the resistive force does not start to pull it in the 
opposite direction. Nearly all surfaces have a certain amount of roughness, 
visible under a microscope, and it is the breakage of these rough protrusions 
or the rising over them that causes the resistance to motion. This resistive 
force is called the force of friction. The earlier example, 4-2, of the boy with 
the boat is another type of friction, that of water resisting the motion of an 
object moving through it. But why consider friction at all in a book about 
semiconductors and their circuits? Because we commonly experience friction 
of the types we are discussing here and thus they are easier to comprehend. 
We will later consider the motion of electrons through a solid under the 
influence of electric forces. The motion of the electrons is impeded by their 
banging into the atoms in the solid and losing energy in the process. This is 
another type of friction. 

Returning now to the behavior of an object with an opposing frictional 
force, we must leave First Principles temporarily and rely on experimental 
data. There are two types of friction, static and kinetic. The starting friction is 
called static. The friction of motion is called kinetic. We observe from experience 
that it is harder to start an object moving across a floor than it is to maintain 
its motion; static friction is larger than kinetic friction. We will only consider 
kinetic friction. If we wish to measure the force of kinetic friction, we have 
only to measure the force required to keep an object in motion at constant 
velocity on a level surface. If we add a weight equal to that of the object on 
top of it, we find it takes twice the force to keep it moving at constant velocity; 
with the object weighing three times as much, then three times the force is 
required. We would correctly conclude that the force of friction is proportional 
to the weight of the object. But, as can be seen from Fig. 4-8, it is equivalent 
to say that the force of friction is proportional to the normal force because 
mg = N. Either way we say that the force of friction is proportional to the 
force pushing the surfaces together. Therefore 


f oa N (4.7) 


Now we know that it is easier to pull or push an object across ice than across 
a floor. Therefore, we may transform the proportionality of Eq. 4.7 to an 
equality by introducing a constant that characterizes the surface. We custom- 
arily use the Greek letter (mu) for this, and yp is called the coefficient of friction. 
Thus 


f = »N (4.8) 


mg 
FIGURE 4-8 
Forces on a mass that give rise 
to a force of friction f. 


Example 4-7 


A force of 10 N is required to keep a box of mass 20 kg moving at a constant 
velocity across a level floor (Fig. 4-9). What is the coefficient of friction? 


Solution Because the velocity is constant, a, = 0 and a, = 0, and 


2F, = 0 


and 


But 


or 


10 N 
20 kg x 9.8 m/sec? 


pw = 0.05 


Suppose that the surface is not level but is inclined by an angle 6 with 
respect to the horizontal, as in Fig. 4-10. We see that the component of the 
weight mg along the axis perpendicular to the plane is —mg cos 0 and, by 
Newton’s second law, because a, = 0, the normal force N exerted by the 
plane on the block is N = mg cos 6. Thus, Eq. 4.8 can be written in a more 
general form as 


f = pmg cos 0 (4.9) 


when @ = 0, f = umg and is at a maximum. When 6 = 90°, the incline is 
standing vertically and there is no force pushing the surfaces together and 


f=. 


Example 4-8 
A block is placed on a plane inclined to the horizontal at 37°. The coefficient 


of friction between the plane and the block is 4 = 0.4. When the block is 
released, what is its acceleration down the plane? 
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FIGURE 4-9 
Example 4-7. 


f = wmg cos @ 
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Solution The forces along the plane are the force of friction f upward 
and the component of the force of gravity Fp downward (see Fig. 4-10). Choose 
the downward direction as positive and write Newton’s second law. 


Ze plane ~ MaApiane 
Fp ~ f a MA€piane 
We have seen in Example 4-5 that 
Fp = mg sin 0 


and Eq. 4.9 gives the expression for f 


mg sin 86 — wmg cos 8 = MApiane 


Solving for the acceleration, we obtain 


Ap — 
plane ni 


= g sin 6 — wg cos 0 


Substituting the known quantities 


mg sin 0 — wmg cos 0 


FIGURE 4-10 
A block in an inclined plane. The 


normal force is equal to the com- 
ponent of the weight perpendic- 
ular to the plane mg cos 6 and 
the force of friction is equal to 
—pmg cos 0. 


Aplane = 9.8 m/sec? X 0.6 — 0.4 X 9.8 m/sec? x 0.8 


2.74 m/sec? 


We see that the mass cancels; that is, all blocks with the same coefficient of 


friction will have the same acceleration down the plane. 


PROBLEMS 


4.1 A 50-N weight is suspended by a rope from 
the ceiling. A horizontal force pulls it sideways, 
causing the rope to make an angle with the ceiling 
of 53°. When the weight is in equilibrium, what is 
the force? 


4.2 A 40-N weight is suspended by a rope from 
the ceiling. Another rope pulls horizontally on it 
sideways so that the suspending rope makes an an- 
gle of 60° with the ceiling.. What are the tensions in 
the ropes? 


4.3 A 50-N weight is suspended by a rope from 
the ceiling. A horizontal force of 40 N pulls on the 
weight in the x direction. (a) What is the angle that 
the rope makes with the ceiling? (b) What is the 
tension on the rope? 


4.4 A 100-N weight is suspended by ropes as shown 
in Fig. 4-11. Find the tension on each rope. 


FIGURE 4-11 
Problem 4.4. 


4.5 A force of 50 N acting at 37° above the hori- 
zontal pulls a block along the floor with constant 
velocity. If the coefficient of friction between the 


block and the floor is 0.2, what is the mass of the 
block? 
(Answer: 23.4 kg.) 


4.6 A 500-kg box is to be lowered down a ramp at 
constant velocity. The ramp makes an angle of 30° 
with the ground. The coefficient of friction between 
the ramp and the box is 0.7. (a) What force applied 
parallel to the ramp is needed? Must the box be 
pushed down or held back? (b) Repeat (a) if the 
coefficient of friction is 0.2. 


4.7 A constant horizontal force of 50 N acts on a 
body that is resting on a smooth, frictionless hori- 
zontal plane. The body is observed to go from rest 
to v = 5 m/’sec in 10 sec. What is the mass of the 
body? 

(Answer: 100 kg.) 


4.8 A body of mass 5 kg rests on a horizontal fric- 
tionless plane. A force of 10 N is applied at an angle 
of 37° above the plane for 5 sec. How far has the 
body moved in that time? 


4.9 An electron of mass 9 X 107°! kg leaves the 
heated filament of a vacuum tube with vp = 0 m/sec 
and travels in a straight line toward a plate 1 cm 
away. It arrives there with a velocity of 7 x 10° 
m/sec. Find the magnitude of the acceleration and 
the accelerating force. 


4.10 A 5-kg mass is attached to the end of a string 
with a breaking strength of 100 N. What is the max- 
imum acceleration that the mass can be given by 
pulling the string in the upward direction? 
(Answer: 10.2 m/sec?.) 


4.11 Three blocks of mass—m, = 3 kg, m. = 4 kg, 
m3 = 6 kg—resting on a frictionless table and con- 
nected by strings with tensions T, and T> are being 
pulled to the right by a force of 6 N (Fig. 4-12). (a) 
What is the acceleration of the blocks? (b) What are 
the tensions in the strings? 


FIGURE 4-12 
Problem 4.11. 


4.12 What horizontal force is required to drag a 
5-kg block along a horizontal surface, with a coef- 
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ficient of friction of 0.5, at a constant acceleration of 
1 m/sec?? 


4.13 A block of 8 kg is held on an incline at 37°. 
The coefficient of friction between the block and the 
incline is w = 0.1. When the block is released, what 
will be its acceleration? 


4.14 Consider an 8-kg block on a frictionless plane 
inclined at 37° to the horizontal, as in Fig. 4-6a. Sup- 
pose a force of 40 N is applied to the block upward 
along the plane. (a) What will be the acceleration of 
the block? (b) If the upward force applied is 60 N, 
what will be its acceleration? (c) What force is re- 
quired along the plane to hold the block motionless? 
(Answer: (a) 0.90 m/sec? downward, 

(b) 1.60 m/sec? upward, (c) 47.18 N.) 


4.15 (a) What constant force acting parallel to a 37° 
plane is required to push a 10-kg block up the plane 
at constant speed if the coefficient of friction is 0.5? 
(b) What force is required to push it up with an 
acceleration of 2 m/sec’? 


4.16 Block A rests on a frictionless plane and the 
connecting cord passes over a frictionless pulley with 
block B attached to it (Fig. 4-13). What is the accel- 
eration of block A along the plane when it is re- 
leased? 

(Answer: 0.67 m/sec? down the plane.) 


FIGURE 4-13 
Problem 4.16. 


4.17 An object is hung by a rope to the ceiling of 
an elevator. When the elevator rises at constant speed, 
the tension in the rope is 50 N. (a) What is the tension 
when the elevator is accelerating upward at 3 m/sec’? 
(b) What is the acceleration of the elevator if the 
tension is 30 N? 


4.18 An object slides down a 37° incline with con- 
stant velocity. After reaching the bottom, it is launched 
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up the incline with an initial velocity of 5 m/sec. How 
far up the incline will it move before it stops? 
(Answer: 1.06 m.) 


4.19 In Fig. 4-14, the masses of blocks A, B, and C 
are 5 kg, 20 kg, and 10 kg, respectively. The blocks 
are observed to move with constant velocity. What 
will be the acceleration of blocks A and B when block 
C is removed? 

(Answer: 0.65 m/sec?.) 


FIGURE 4-14 
Problem 4.19 


4.20 A 40-N block is connected to a second block 
by a light rope passing over a frictionless pulley, as 
in Fig. 4-15. The coefficient of friction between the 
blocks and the inclines is 0.25. If the 40-N block 
moves up the plane at constant velocity: (a) What is 
the weight of the second block? (b) What is the ten- 
sion in the rope? (c) Suppose that the 40-N block is 
replaced by a 100-N block and the coefficient of fric- 
tion remains unchanged, what will be the acceler- 
ation of the blocks? 


FIGURE 4-15 
Problem 4.20. 


4.21 A rope of breaking strength 800 N is to be 
used to drag a box at constant velocity on a hori- 
zontal surface. The rope pulls the box at some angle 
6 above the horizontal. If the coefficient of friction 
is 0.3, what is the maximum weight of the box that 
can be moved without the rope breaking? 

(Answer: 2784 N.) 


4.22 A 10-kg ball is hung by a rope from the ceiling 
of a car. The maximum tension that the rope can 
withstand is 500 N. (a) What is the maximum hor- 
izontal acceleration that the car can reach without 
the rope breaking? (b) What is the angle between 
the rope and the vertical for that acceleration? 
(Answer: (a) 49.03 m/sec’, (b) 78.7°.) 


4.23 A block is held against the front vertical wall 
of a railroad car, as in Fig. 4-16. The coefficient of 
friction between the block and the wall is 0.4. When 
the train begins to accelerate, the block is released 
and begins to slide down the wall with an acceler- 
ation of 9.0 m/sec”. What is the horizontal acceler- 
ation of the train? 
(Answer: 2 m/sec?.) 
FIGURE 4-16 ee -~ 
Problem 4.23. S23" Seat rares yr tesr 


4.24 In Fig. 4-17, block B of mass 5 kg is being 
pulled by a force F = 100 N. The mass of block A 
is 2 kg and the coefficient of friction between all 
surfaces is 0.2. The pulley is frictionless. Find the 
acceleration of the blocks and the tension in the rope. 


FIGURE 4-17 
Problem 4.24. 


F : 


Work, Energy, 
and Power 
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5.1 INTRODUCTION 


The terms work, energy, and power are common words in English. In physics 
we require precise definitions so that these terms can be formulated mathe- 
matically. Readers will find that definitions in physics do not always match 
the usage of the words. For example, although a student may do homework, 
by the physics definition of work none is being done, although at the biological 
cellular level, chemical work is being done. We will not consider chemical 
work in this book. It is important that we consider mechanical work, energy, 
and power, for it is the treatment of these terms from First Principles that 
will be applied directly to electrical circuits. It is therefore essential that the 
physics definitions of these terms be learned carefully. 


5.2 WORK 


In this first treatment of work we will restrict our consideration to that done 
by a constant force. Our definition of an amount of work AW done by a 
constant force F acting on a body is: The product of the distance the body is 
moved in a given direction by the component of the force in that direction, 


AW = F, As (5.1) 


where F, represents the component of force in the direction As. If F, is 1 N 
and As is 1 m, then the work AW done on the body by the force F is 1 newton- 
meter (N-m). We define a new unit, 1 N-m = 1 joule (pronounced in America 
as jewel) with symbol J. 


Example 5-1 


A box is pushed 3 m at constant velocity across a floor by a force F of 5 N 
parallel to the floor. (a) How much work was done on the box by the force 
F, which clearly opposes friction (see Fig. 5-1). (b) How much work is done 
on the box by the force of friction? 


Solution 


(a) W=5N X¥3m=15]J 
(b) Because a = 0 
ZF, = 0 
F-f=0 
f=5N 


FIGURE 5-1 
Example 5-1. 


The component of f in the direction of the displacement vector Ax is —f 
(because frictional forces always act against the motion); therefore 


AW = (—f) Ax 
= -15J 


Although work may be positive or negative, it has no direction and is 
therefore a scalar quantity. We will now show that work meets the condition 
of the vector dot product of Chapter 2, which was defined as a scalar. Suppose 
the force pulling a box across a floor is not in the direction of motion but is 
in the direction shown in Fig. 5-2. Following the definition of work, we must 
take the component of the force in the direction of motion 


AW = F, Ax 
= Fcos 6 Ax 
AW = F Ax cos 0 (5.2) 
which, by Eq. 2.1, is 
AW = F: Ax (5.3) 


Because Eqs. 5.2 and 5.3 are expressions of the general relation of Eq. 2.1, 
the F and Ax in Eq. 5.2 are the magnitudes, and we thus have a scalar. Eq. 
5.2 has an important conceptual implication. Suppose we carry a weight mg 
across the room. If it is initially placed in our hands and we carry it slowly, 
without appreciable acceleration, the only force we exert is in the upward 
direction. 


2F, = 0 
—mg + F, = 0 
Fy = mg 


If we move a distance Ax, from our definition of work 
AW = F, Ax cos 0 


the angle 6 between the force direction and the motion direction is 90°. There- 


fore we do no work. 
We may use the definition of Eq. 5.2 to treat formally the work of friction 


of Example 5-1. 
AW = f Ax cos 6 
but 6 = 180° and cos 180° = —1, so we could have simply taken the magnitude 


of friction times the distance and the cos @ would have yielded the correct 
sign. 
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FIGURE 5-2 
Force and motion are not in the 
same direction. 


AW = F- Ax 
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5.3 POTENTIAL ENERGY 


Consider now that we lift a weight mg at constant velocity a distance y from 
the floor. Because gravity exerts a downward force mg, we must exert an 
upward force of equal magnitude over a distance y and therefore have done 
an amount of work mgy on the weight. If we lower it back to the floor, we 
still exert an upward force mg, but now the motion is in a direction opposite 
to the force, or 86 = 180°, and the work done by us is —mgy and therefore 
no net work has been done in the round trip. Suppose we wish to move a 
weight mg from the floor to a shelf a distance x across the room. There are 
many paths that we may take, some of which are shown in Fig. 5-3. An 
examination of these paths shows that each move in the y direction contributes 
positive or negative work amounts whose sum must be mgy, whereas motion 
in the x direction contributes no work. We are thus able to draw a very 
important conclusion. Work done against the gravitational force is independent of 
the choice of path between any two fixed endpoints. 

Suppose an object is placed at a height y in a gravitational field, as in 
Fig. 5-4. If it descended from y, the gravitational force mg on the object would 
be capable of doing work equal to the force times the displacement mey. 
Therefore, because the gravitational force is potentially able to do work on 
the object, we say that it has a potential energy E,, equal to mgy. (Note that the 
unit of potential energy is the same as that of work, that is, the joule) 

E, = mgy (5.4) 
That is, if we have lifted it by doing work mgy on it, then it has gained a 
potential energy of mgy, where we use the positive value because the work 
was put into it. Note that there is a direct relation between the work done 
on an object in a gravitational field and its gain in potential energy. The 
measure of y must be considered with care. Suppose an object is lifted above 
a table to a height y;. It has E, = mgy, with respect to the table. But if the 
table is at a height y3 above the floor, the object has E, = mg (y, + y3) with 
respect to the floor (see Fig. 5-5). 

Thus, for potential energy, a reference level must always be specified. If 
we lift an object two different distances above a table, then we may state the 
difference in potential energy between the two positions. If we included the 
distance above the floor for each, then when we take the difference in their 
E,, the distance above the floor will cancel, 


AE, (with the table as the reference level) 


= MZY2 — MEY, = MR(Y2 — Yi) 


AE,, (with the floor as the reference level) 


mg (Y2 + ¥3) — mg(yi + Ys) 
MgY2 — MgYy, = mg(y2 — yi) 


mgy, 


W my, + migys mgy 
rt ~ ie = 


MEY mgy, 4 
mgy 


W mgy, + mgys 4 
" ~ ie 


mgy, + mgy, + 

ie, 6 
FIGURE 5-3 
Examples of different paths used 
in raising a mass to a height y. 


E, = mgy 
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We see in this answer that only the difference in heights needs to be specified 
to give the relative difference in potential energy. We will use this concept 
in electricity to specify the relative difference in potential energy of a charged 
particle in two different positions in an electric field. To apply the concept of 
potential energy in later chapters we will need to expand our consideration 
of work to that done by a variable force. 


5.4 WORK DONE BY A VARIABLE FORCE 


We have seen that when the force acting through a distance Ax is constant, 
then the work done may be written as 


AW = F- Ax = F, Ax (5.3) 


Suppose that at each small displacement of the motion the force has a 
different value. Then we would write Eq. 5.3 as 


W = Fy Ax + Fy2 Axe ar Fy3 Ax3 =f eee Pin Axn 


Or 


(5.5) 


A sketch of this summation is shown in Fig. 5-6 for the work done in moving 
a body from x = a to x = b. We see that each vertical segment of Ax; width 
has associated with it an average F,;. The total work is the sum of the areas 
of these segments. If we make the width of each segment very small so that 
Ax — 0, the number of segments required to obtain the area under the curve 
approaches infinity and the sum of these infinitesimal areas becomes the 
precise area under the curve of Fig. 5-6, or the total work. This is the definition 
of an integral. That is, Eq. 5.5 as Ax — 0 becomes 
b 
W= | Fede (5.6) 
a 
Furthermore, by the definition of an integral, and with reference to Fig. 5-6, 
we see that work is the area under the F, versus the x curve. In the derivation 
of Eq. 5.6 we have assumed that F, although not constant in magnitude, is 
always in the direction of the displacements Ax’s. In the more general case 
where F and the general displacement As’s are not in the same direction, the 


expression for the work becomes 
dW = F -ds (5.7) 


or the integral form 


b 
w= | F - ds 65.7’) 


a 
| 
| 

FIGURE 5-4 
@___ 
i J] ¥ 


Ax, Axo Ax3 Axg 3x5 Ox6 
— 


FIGURE 5-6 
Area under the curve obtained 


by summation of the-area of rec- 
tangles. 
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where the dot product takes care of changes in orientation between the vectors 
F and ds. 


9.9 KINETIC ENERGY 


The word “kinetic” is used frequently in all branches of science and is from 
the Greek word for motion. If work is done on a body that changes its state 
of motion, namely, its velocity, we say that the work has caused the body to 
gain or lose kinetic energy, E,. We will show two derivations of kinetic energy, 
the first for the restricted condition of a constant force and the second for a 
variable force. In both cases we will consider motion in the x direction alone. 

If the force is constant and the initial position is x = 0, we may write 
the definition of work as 


W = F,x 

which from Newton’s second law (Eq. 4.4) can be written as 
W = ma,x 

or, if we take a only in the x direction, we may drop the subscript and write 
W = max (5.8) 


Because the force is constant and we are considering a single body of 
constant mass, the acceleration is constant and we may use Eq. 3.11 


— vg = 2ax (3.11) 


v2 


where vp is the velocity at x = 0 and v is the velocity at x. Substituting this 
equation into Eq. 5.8 we obtain 


u- — VA 
W = m3{—_° 
m{ 2 


1 
W = = mv? — 5 Nv (5.9) 


Thus the work done on a body that changes its velocity actually changes the 


1 | 
quantity 5 mv’, which is called the kinetic energy Ex. 


Ex, = = mv? (5.10) 


Because change in kinetic energy is equal to work and work is a scalar quantity, 
kinetic energy is also a scalar quantity and the unit of kinetic energy is the 
same as that of work, that is, the joule. 


Ey = 
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We obtain the same result if the applied force is not constant but is 
variable. Let us consider the x axis motion again so that 


w= | Fax 


In this case F must remain inside the integral, but we can substitute Newton’s 
second law for it, F = ma 


W=m{ adz (5.11) 


And we may substitute for a from Eq. 3.13 


a=v i 3.13 
dx ea) 
Thus Eq. 5.11 becomes 
W=m . ) a dx 
Xo dx 


W 


(é) 
m | v av 
Yo 


where we have changed the limits of integration because the velocity is vp at 
Xo and v at x. This integrates to 


W 


I 

= 
NIS 
| 
S$ Q 


1 1 
— mv? — = mv (5.9) Ww = 


W 
2 2 


? ? 
mu — = Mv5 
‘, 


— 


rh | — 


which is the same result as found before for a constant force. Note that in 
both derivations we have: used F = ma where F is the net, or resultant, force. 
Ifa = 0, then F = O and there is no net, or resultant, force and hence there 
can be no change in the kinetic energy. Eq. 5.9 is known as the work-energy 
theorem, which states that the work done by the resultant force acting on a particle 
is equal to the change in kinetic energy of the particle. 


5.6 ENERGY CONSERVATION 


We define a mechanically conservative system as one in which no energy enters 
or leaves the system (as heat, radiation, or such). Therefore, the system's 
initial energy is unchanged, which is the same as saying it is conserved. This 
fact simplifies the solution of many types of problems because we do not 
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have to calculate the acceleration. For conservative systems we know that the 
total energy in state 1 is the same as that in state 2. Because the total energy 
is the sum of the potential and kinetic energies, we write 


(Ex + Ep)initiaa = (Ex + Ep)sinas (5.12) 


We can verify explicitly the conservation of the total energy with the 
simple example shown in Fig. 5-7. 

Let us launch an object of mass m from a point y; above the floor with 
an initial velocity v;. Owing to the gravitational force that acts on the object, 
its velocity decreases as it rises. Sometime later, the velocity of the object will 
be v2 and its position y2. We can relate this new velocity and position to the 
initial velocity and position by means of Eq. 3.11. 


v3 — vi = 2(—g)(y2 — yi) 
Rearranging terms, 
vs + 2gy2 = vt + 2gy) 
If we divide both sides by 2 and multiply by the mass of the object m, we 
get 


1 1 
5 nv; + mgy2 = 5 moz + mgy, (5.13) 


Thus, although the velocity of the object (and therefore its E;,) decreases as 
it rises, this decrease in E;, is compensated by an increase in E, in such a way 
that the sum of E, and E, remains constant and equal to the sum of the initial 
kinetic and potential energies. That is, total energy is conserved. 


Example 5-2 


Suppose a ball is dropped from a height h = 10 m. What is its velocity just 
before it strikes the ground? 


Solution 
Eg — Eni = Exg “te Eng 
1 
0+ mgh = > mv + 0 


v= + V2gh = + V2 X 9.8 m/sec? X 10m = —14 m/sec 


where the negative sign is chosen because the motion is downward. 
The pendulum is another simple example of the conservation of energy. 


Let us assume an idealized pendulum that swings in a vacuum so that there 
is no energy lost to air friction and that there is no frictional loss at the pivot 


FIGURE 5-7 


| ] 
mvs + MmgYy2 = 5 muy + my 


| — 


A falling tree illustrates the con- 
version of potential energy into 
kinetic energy. 


(see Fig. 5-8). If we start the pendulum by pulling it to one side and releasing 
it with no initial velocity, it has an initial potential energy of mgho, which is 
also the total energy. If there is no energy loss during its subsequent motion, 
it must always have this amount of total energy. When it is released, it begins 
to fall and potential energy is lost. But as it falls it picks up speed and thereby 
gains kinetic energy. At the bottom of its path, h = 0 and all its energy is 
kinetic. As it starts to rise again, the kinetic energy is converted to potential 
energy. Thus, if ho is its initial height, mgho is the energy of the pendulum 
and the sum of the potential and kinetic energies at all other positions must 
equal this value. We may write this as 


E,0 + Exo = E,o Si Ep 


mghy + 0 


1 
mghy + = mvs 
2 
The string does no work on the pendulum because of the definition (Eq. 5.7) 
dW = F-ds = F cos 0 ds 


The angle @ is that between the string direction and ds, the instantaneous 
direction of motion. This angle is always 90°, so that dW due to the string is 
always zero. 

If the system loses energy or energy is put into the system, it is no longer 
a mechanically conservative system. Later, when we understand other types 
of energy and can enlarge the system to include all sources of input and 
output, we will again develop the concept of conservation of energy. For now, 
however, let us say that all energy must be accounted for and use the term 
accountability of energy. We become accountants and keep books of assets and 
liabilities. All initial energy plus any energy put in, E;,, is on the left side of 
the ledger as an asset. All energy converted to another form or escaping from 
the system, Eouz, may go on the right side. Thus, Eq. 5.12 is written as 


Ex 2 Ey + En = vg ar Ev oF aa (5.14) 


Example 5-3 


A skier is on a 37° slope of length s = 100 m (Fig. 5-9). The coefficient of 
friction between his skis and the snow is 0.2. If he starts from rest, what is 
his velocity at the bottom of the slope? 


Solution 
Ex =F Ee “+ Ein = Eng = Leg F Bc 


No energy is put in, but there is energy lost to work against friction, or 
out = |Weriction| = lf ° s| 
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| 
| 
| 
| 
| 
. | F 
we | - 
ee Oe ek Jy = 0 
FIGURE 5-8 


FIGURE 5-9 
Example 5-3. 
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As before, let us tilt our coordinate axis so that the slope becomes the x 
axis and the normal becomes the y axis. 


2F, = 0 
N — mg cos 37° = 0 
N = mg cos 37° 


f = uN = mme cos 37° 
Eout = |f£-s| = umg cos 37° (s) 
1 
0+ mgh +0 = 5 MOF + 0 + ume cos 37° (s) 
The mass cancels, and we solve for Ur 
v1 = [2(gh — pgs cos 37°)]? 
= [2(9.8 m/sec? x 100 m x sin 37° 


— 0.2 x 9.8 m/sec* x 100 m xX cos 37°)]"” 


= 29.4 m/sec 


5.7 POWER 


Different persons or different machines may take different amounts of time 
to do the same amount of work. The term used to describe this rate of per- 
formance of work is power. 


work done 
Power = ——_ 
time taken 
W 
P= = (5.15) 


Work is measured in joules, time in seconds, therefore the unit of power is 
joules per second (J/sec). We introduce a new unit: 1 J/sec = 1 watt. Con- 
versely, work (or energy) is equal to power x time, 


W(joules) = P(watts) t(sec) 


The symbol used for watt is W. A 100-W light bulb uses 100 J of electrical 
energy each second. Your electric light bill is in kilowatt-hours. A kilowatt- 
hour is the energy dissipated by a device that uses 10° W for a period of 1h, 
that is, 1 kwh = 10? J/sec x 3600 sec = 3.6 X 10° J. | 

We may develop another expression for power because work is defined 
by Eq. 5.3 as the dot product of force F and displacement As, where As can 


be in the x direction. From Kq. 5.15 


F + Ax 
P i eres 
At 
or for infinitesimally small displacements dx 
dx 
P= Fe 
at 
but 
d 
] = v, the velocity 
and therefore an alternative equation for power is 


P=F-v 
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(5.16) P= Bey 


There is an engineering unit of power that we might introduce in passing: 


horsepower. This is defined as 


lhp = 746 W 


Exa mple 5-4 


A tractor can exert a force of 3 x 10* N while moving at a constant speed of 


5 m/sec. What is its horsepower? 
Solution 
P=F:v 
3 x 10*N x 5 msec 


746 W 


PROBLEMS 


5.1 Acar is pulling a trailer with a force of 900 N 
on a level road. The car is moving at 70 km/h. How 
much work is done by the car on the trailer in 15 
min? 


5.2 A 50-kg box is being pushed on a horizontal 
surface, at constant velocity, by a 90-N force acting 
at an angle of 15° below the horizontal. (a) How 
much work is done by the 90-N force in moving the 


1.5 x 105w Gay, = 200 hp 


box a distance of 20 m? (b) How much work is done 
by friction over the same distance? (c) What is the 
force of friction? 


5.3 A force of 20 N parallel to a 37° plane pulls a 
2-kg block 5 m up the plane at a constant speed. (a) 
How much work has been done by the 20-N force? 
(b) How much work has been done by friction? (c) 
How much work has been done by the gravitational 
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force acting on the block? (d) What can you say about 
the total work done? 
(Answer: (a) 100 J, (b) —41 J, (c) —59 J, (d) zero.) 


5.4 In Fig. 5-10 the force acting on a body for x < 
10 m is F, = 0.2 x N and the force for x > 10 m is 
constant at F, = 2 N. What is the work done in 
going from x = 0 tox = 15 m? 


FIGURE 5-10 
Problem 5.4. 


5.5 A boy pulls a 15-kg sled at constant speed a 
distance of 30 m along rough level snow that has a 
coefficient of friction of 0.1. How much work did he 
do? 


5.6 A 40-kg box is to be pushed at constant speed 
a distance of 5 m up a ramp by a force parallel to 
the ramp. The coefficient of friction between the box 
and the ramp is 0.25. The ramp makes an angle of 
37° with the horizontal. How much work must be 
done? 

(Answer: 1571 J.) 


5.7 Acar traveling at 30 m/sec suddenly brakes. If 
the coefficient of friction between the tires and the 
road is 0.7, what is the minimum stopping distance? 
Solve by energy methods. 


5.8 A bead having an initial speed at point A of 
2 m/sec slides down a frictionless wire (see Fig. 5- 
11). What are its speeds at points B and C? 

(Answer: vg = 4.21 m/sec, vc = 3.44 m/sec.) 


A Up = 2 msec 


0.7 m 


FIGURE 5-11 
Problem 5.8. 


|0.3m 
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5.9 A horizontal force of 20 N pulls a 10-kg block 
on a level frictionless surface a distance of 5 m. (a) 
How much work is done, and what becomes of this 
work? (b) Show, using the methods of Chapters 3 


and 4, that the change in the kinetic energy is equal 
to the work done by the force. 


5.10 An automobile is moving with a velocity of 
90 km/h. From what height would it have to fall to 
acquire that velocity? 


5.11 Using the conservation of energy principle, 
find the maximum height reached by a projectile 
launched with a velocity of 80 m/sec at an angle of 
37° with the horizontal? 


5.12 A pendulum consists of a mass at the end of 
a string 1.5 m long. The mass is pulled sideways 
until the string makes an angle of 30° with the ver- 
tical; then it is released. What is the speed of the 
mass as it passes through its lowest point? 
(Answer: 1.98 m/sec.) 


5.13 A light rope passing over a frictionless pulley 
connects two blocks of mass m, = 3 kg and m2 = 
5 kg (see Fig. 5.12). (a) If the blocks are released from 
rest in the position shown in Fig. 5-12, what will be 
the velocity of m2 as it hits the ground? (b) What is 
the final height reached by m,? 

(Answer: (a) 5.42 m/sec, (b) 7.5 m.) 


FIGURE 5-12 
Problem 5.13. 


5.144 A 40-kg box slides 5 m down a ramp inclined 
at 37° to the horizontal. If there is no friction, what 
is its speed at the bottom? If the coefficient of friction 
is 0.2, what is its speed at the bottom? 


5.15 A5-kg block rests at the top of a rough plank 
inclined at 25° with respect to the horizontal and 
4 m long. It is given an initial speed downward of 
2 m/sec, and it just reaches the bottom before it stops. 
What is the coefficient of friction? Solve by energy 
methods. 

(Answer: 0.52.) 


5.16 A constant force of 60 N parallel to an inclined 
plane of 30° above the horizontal pushes a 6-kg block 
10 m up the incline. The coefficient of friction be- 
tween the block and the incline is 0.25. (a) What is 
the velocity of the block at the 10-m point if the block 
starts from rest? (b) If the force is removed at that 
point, how much farther up the incline will the block 
go? (c) At the uppermost point the block starts slid- 
ing down. What is its speed when it reaches the 
bottom? Use the energy methods. 

(Answer: (a) 7.72 m/sec, (b) 4.24 m, (c) 8.90 m/sec.) 


5-17 A rigid rod of negligible weight and length 
1 = 2mhasa mass of 5 kg attached to one end. The 
other end is pivoted about a point 0, as shown in 
Fig. 5-13. The mass is released from the position 
shown with some initial speed vp. As the mass swings 
around it experiences an average frictional force of 
12 N and just reaches the top of the circle, point P. 
(a) What is the initial speed vp of the mass? (b) What 
is the speed of the mass as it passes through the 
lowest point P’? 


FIGURE 5-13 
Problem 5.17. 


5.18 A mass m, = 3 kg resting on a long table is 
connected by a light string passing over a frictionless 
pulley to a second mass m2 = 5 kg hanging 2 m 
above the floor (see Fig. 5-14). The coefficient of 
friction between the table and m, is 0.3. The blocks 
are released from rest. (a) What is the velocity of m2 
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as it hits the floor? (b) What is the total distance 
traveled by m, before it stops? 
(Answer: (a) 4.48 m/sec, (b) 5.42 m.) 


ni 1 


FIGURE 5-14 ms 


Problem 5.18. 


5.19 An 80-kg man ascends a 4 m high staircase in 
12 sec. What is his horsepower? 


5.20 An elevator of mass 800 kg is raised 10 m in 
5 sec. How much power is required? Express your 
answer in watts and in horsepower. 


5.21 Acable car is operated on a slope 1000 m long 
making an angle ot 20° with the horizontal. The cable 
car moves up the slope with a speed of 3 m/sec and 
carries 20 persons of average weight 600 N. What 
power is needed? 

(Answer: 1.23 xX 10+ W.) 


9.22 The piston of a steam engine is driven 120 
times per minute. The length of the stroke is 0.5 m. 
If the engine develops 150 kW of power, what is the 
average force exerted by the steam on the piston? 


5.23 A pump is needed to lift 100 kg of water per 
minute from a well 30 m deep. The water is ejected 
with a speed of 5 m/sec. What must be the power 
output of the pump? 

(Answer: 510.8 W.) 


5.24 A 2500-kg automobile develops 30 kW of power 
to drive with a constant velocity of 90 km/h on a 
level road. What power must it develop to drive up 
a 15° hill with the same velocity? 

(Answer: 1.88 xX 10° W.) 
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6.1 INTRODUCTION 


Momentum is the product of the mass of a body and its velocity. It is therefore 
a vector. We first mentioned momentum in Chapter 4 as the seed whose 
germination led to modern thoughts on the mechanical behavior of newtonian 
bodies. In our description of this development, however, we did not consider 
with sufficient care what was meant by a body. In many cases it is an assembly 
of particles. In this chapter we will first show how such an assembly can be 
mathematically represented by a point mass, called the center of mass. We will 
then show that the motion of the center of mass is that predicted by Newton’s 
second law for a particle whose mass is the sum of the masses of the individual 
particles and is acted on by the resultant of the forces acting on the body. 
Having established these facts, we will turn our attention to the momentum 
changes of colliding bodies with confidence, knowing that the treatment of 
bodies is as mathematically sound as if they were very small masses. Collision 
theory is very important in our later analysis of conduction electrons in solids. 


6.2 CENTER OF MASS 


If you have a stick, whether uniform or not, you can find a point along its 
length that we call the “balance point.” If you place your finger there, you 
can support the stick. Clearly, from Newton’s law, the sum of the forces in 
the y direction is zero at that point, with your finger supplying the upward 
force. The weight of the stick supplies the downward force, and it appears 
to be located at that point, although we know that every segment of the stick 
has weight. We call this point the center of gravity of the stick. . 

If, while balancing the stick on your finger, you suddenly exert an impulse 
on it by moving your finger rapidly upward, the stick will fly upward without 
rotating. That is, all parts of the stick will move upward uniformly and the 
stick will therefore retain the configuration that it had on your finger. If, when 
it is in motion, you catch it at the center of gravity, the entire stick will stop. 
These experiments show that the center of gravity of the stick behaves as a 
point mass in Newton’s second law, F = ma, and that the center of gravity 
may also be considered as the center of mass. That is, if you perform these 
same experiments in distant space where the force of gravity is negligible, 
you will obtain the same results. Similarly, if you have a piece of cardboard, 
you can find a point on the surface at which you can place your finger and 
support it. All the weight of the cardboard can be considered to be located 
at that point. Although it is a more difficult experiment to perform on a three- 
dimensional object, it may be shown mathematically that it too has a center 
of mass. A better understanding of what the center of mass is will be obtained 
when we introduce the concept of torque in Chapter 8. 

Suppose that we have two masses, m, and m2, on a weightless stick as 
in Fig. 6-1a. If we perform a measurement, we will find that the balance point 
(center of mass), is the point at which the products of the masses and their 


(br) 
FIGURE 6-1 


respective distances from the balance point are equal. From Fig. 6-1a, the 
center of mass is the point such that 


ma = Mob (6.1) 


Let us put this figure on an x axis, as in Fig. 6-1b. We may express Eq. 6.1 
in terms of x distances by noting that 


A=Xm— xX and b= x —- Xan 
Equation 6.1 becomes 
My(Xem — X1) = M2(X2 — Xem) 


Rearranging gives 


(my + My)Xem N71 X41 + MoX2 


or 


19,X1 + MoXo 
Xem a ia ire 
Mm, + Mb» 
This relation holds true regardless of the number of masses placed on the 
balance, so we may write 
n 
Mm ;X; 
(6.2) 


Xcm 


=1 

n 

>, mM, 
i=1 


But > m; = M, where M is the total mass. We can therefore express Eq. 6.2 
i=] 
as 


1 
Xem = Fy 2, mix (6.3) 


Example 6-1 


Find the center of mass of the configuration in Fig. 6-2 when m, = 1 kg, 
mM, = 2 kg, and m3; = 3 kg. 


Solution 


1 
cm —— iAj 
x M MX 


If we take the position of m, as the origin, we write this equation as 


1 
aS + 2kg x 0. + 
x feo 2kg x 0.5m + 3 kg X 1.3 m) 


0.82 m 


CENTER OF MASS 


iTl | 
0.5n m 


FIGURE 6-2 
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Example 6-1. 
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If we had chosen any other point as the origin, the position of the center of 
mass relative to the individual masses would have been the same, although 
the numerical value of x... would have been different. 


In the general case, when the masses do not lie on one of the axis, we 
define the center of mass as the point whose cartesian coordinates are 


1. ~ | 
= M & MX; (6.3) 
ee 6.4 
Yom = Mm Yi (6.4) 
ae Shane 6.5 
“on M I~! (6.5) 


where X;, y;, 2; are the coordinates of the ith particle, all measured from the 
same arbitrary origin. The reason for defining the center of mass in this 
manner will become obvious in the next section. 


Example 6-2 

Find the x and y coordinates of the center of mass of the system shown in 
Fig. 6-3, where m, = 2 kg, m. = 3 kg, mz = 4 kg, and m, = 1 kg, and the 
coordinates are (3,4)m, (4,6)m, (5,5)m, and (6,8)m, respectively. 


Solution We note that each mass has both an x and y coordinate and 


therefore each contributes to the x. and the Yom. 
For x. we write 


1” 
Xom = 7, Ss IN;X; 
M i 


= —— (2kg x 3m + 3kg x 4m+4kg xX 5m+1kg X 6m) 
5 


= 4.4m 


For Yen We write 


1 nN 
Yom = M 2 MY; 


1 
ie ey OR ee rae eee ay 


= 54m 


x cn ~ 


Y em ~— 


i 
rn 


FIGURE 6-3 
Example 6-2. 


MOTION OF THE CENTER OF MASS 


6.3 MOTION OF THE CENTER OF MASS 
Let us rewrite the expression for the x coordinate of the center of mass of an 
array of masses, Eq. 6.3, in the following way. 

MxXem = MX, + MpxX2 + ++ * MypXy (6.6) 


Differentiating this with respect to time, we obtain 


AX em Ax, AX dx, 
Ma Sg a ae 
dt * dt * dt dt 
or 
Mi xem = MyVx, + M2Vy2 + °° * MyVey (6.7) 


Similar expressions can be readily obtained for Mv,cm and Mvzcm. 

Equation 6.7 and the equivalent equations for the y and z motion show 
that the total momentum of all the particles is equal to the momentum of a 
single particle whose mass is equal to the sum of the masses of the particles 
and moves with the velocity of the center of mass. Let us now differentiate 
Eq. 6.7 with respect to time. 


On dv. AV x2 AVxn 
M—— =m +m PP, 
dt ‘dt * dt dt 
or 
M Axe = M1 Ay. + Me Aya + °° * My Axy (6.8) 
We can apply Newton’s second law (F = ma) to each individual particle; 
that is, F,. = m14,1, Fy = Moa, ... . Substituting this in Eq. 6.8, we have 


Mie = Fa Fy t !** bax (6.9) 


where F,; is the x component of the resultant (i.e., the sum) of the forces 
acting on the ith particle. A system of masses may be connected by internal 
forces such as the binding forces of a solid. There may be additional external 
forces acting on the solid. By Newton’s third law of action and reaction, each 
force exerted internally by a particle on another has an equal and opposite 
force exerted internally on it. Therefore, the sum of internal forces on the 
right side of Eq. 6.9 must be zero. We conclude that the sum of the forces in 
Eq. 6.9 includes only the external forces acting on the system of particles. We 
can rewrite Eq. 6.9 simply as 


n 
> Pa = Maycm 
i=1 


The same equation can be derived for the y and the z components of the 
external forces, and we have the vector equation of Chapter 4 


F = Magn (6.10) 


F = Maan 
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where F is the resultant of the external forces acting on all the particles. This 
result shows that the center of mass moves as if it were a point whose mass 
is equal to the total mass of the system and all the external forces were acting 
on it. And this is why the point defined by Eqs. 6.3, 6.4, and 6.5 is called the 
center of mass. 


Example 6-3 


Suppose a grenade is thrown that has the trajectory shown in Fig. 6-4. If it 
explodes in midair, only internal forces have acted on the fragments and 
therefore from Eq. 6.10 the acceleration of the center of mass of the fragments, 
regardless of their subsequent dispersal, is unchanged by the explosion, and 
thus follows the original trajectory. 


6.4 MOMENTUM AND ITS CONSERVATION 


Recall Newton’s approach to mechanics from Chapter 4. He said that an 
impulse applied to a body will change its state of momentum (Eq. 4.1). 


F At = Amv (4.1) 
F At = mv; — Mvo 


where vo is the velocity of the body before the force begins to act on it and 
v; is the velocity when the force stops acting on the body. 
Momentum is often represented by the letter p, 


F At = py — Po (4.1’) 


If there is no external force exerted on a mass, the left side of Eq. 4.1’ is zero 
and we may write 


This simple equation is called the law of conservation of momentum. It is im- 
portant to recognize that momentum is a vector and that Eq. 6.11 must be 
satisfied in all three cartesian coordinates. 

We can easily extend this law to a system of particles using the results 
developed in the preceding section. 

If the resultant of the external forces acting on all the particles is zero, 
then from Eq. 6.10 the acceleration of the center of mass is a.,, = 0. Therefore, 
the velocity of the center of mass will be constant. We can then conclude, 
from Eq. 6.7 and the equivalent equations for the y and z directions, that the 
total momentum of all the particles will not change, or 


(Sp) = (SP) (6.12) 
i=1 before i=l after 


We should note that Eq. 6.12 does not imply that the momenta of the indi- 
vidual particles remains constant. The individual momenta can change as a 


rt 


FIGURE 6-4 

A grenade explodes while in a 
trajectory of projectile motion. 
Because there has been no ex- 
ternal force involved in the ex- 
plosion, the motion of the center 
of mass is unchanged. 


ct TEu~t 
(Sp) =(Sp) 
\ 1 before c=] / after 
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result of internal forces such as in Example 6-3, but the total momentum 
remains unchanged. 


FIGURE 6-5 
Example 6-4. 


Example 6-4 


A cannon of mass 1000 kg fires a 100-kg projectile with a muzzle velocity of 
400 m/sec (see Fig. 6-5). With what speed and in what direction does the 
cannon move? 


Solution Let M be the mass of the cannon, Vj its initial velocity, and V, 
its final velocity. Let m be the mass of the projectile and vo and y,; its initial 
and final velocities, respectively. If we consider the cannon and projectile as 
our system of particles, no external force is involved in the firing of the 
projectile and we conclude that | 


Po ~ Pf 
Substitute the terms on each side of this equation 
MVo + MV —= MV; P MV; 


If we choose the direction of motion of the projectile as the positive direction, 
and noting that Vp and vo are zero, we get 


0+ 0 = mv, + MV; 
or 
Vee eee 
M 1000 kg 
V; = — 40 m/sec 


Note that although from experience we know that the cannon will recoil (i.e., 
move in a direction opposite to that of the projectile), we are not told this as 
one of the facts of the problem. So we put V; in as positive; the vector aspect 
of the formulation shows in the result (i.e., the fact that V; is negative) that 
the direction of recoil is opposite to that of the projectile. 


In the next section we will consider problems in collisions whose solutions 
involve both momentum conservation and energy accountability. However, 
as the following example illustrates, some questions about collisions can be 
answered by momentum conservation alone. 


Example 6-5 


A 10,000-kg truck traveling east at 20 m/sec collides with a 2000-kg car traveling 
north at 30 m/sec. After the collision, they are locked together. With what S 
velocity and at what angle do the locked vehicles move immediately after the FIGURE 6-6 
collision? (See the schematic diagram, Fig. 6-6.) Example 6-5. 
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. i : ] e e . cy ° 
20!ulI0ON Because no external force is involved in the collision, momen- 
tum is conserved. In the x direction 
Pxo = Pxf 


MyVr = (mr + m,)V cos 0 
Rearranging terms, 


NyUT _ 10,000 ke x 20 m/sec 


"00s 0 = +m. 10,000 kg + 2000 ke 
= 16.7 m/sec 
In the y direction, 
Pyo — Pyf 


MV, = (mr + m,)V sin 6 
Solving for V sin 6, 


MV; 2000 kg x 30 m/sec 

SIO ee Se 

mr +m, 10,000 kg + 2000 kg 
= 5.0 m/sec 

First find the angle by dividing the two velocity components 


Y sin 0 5.0 m/sec 
—=—— = tan 0 = ———— 
Y cos 0 16.7 m/sec 


6 = arctan 0.30 = 16.7° 


= 0.30 


Find V by substituting the angle into either the x or y momentum solutions 
V sin 16.7° = 5 m/sec 
V = 17.4 m/sec 
or | 
V cos 16.7° = 16.7 m/sec 
V = 17.4 m/sec 


6.5 COLLISIONS 


One of the most important applications of the conservation of momentum 
law occurs in the theory of collisions. We will deal only with collisions between 
two bodies, as it is exceedingly difficult to obtain any but approximate so- 
lutions for three-body collisions. There are two types of collisions, to which 
we give the names elastic and inelastic. In an elastic collision kinetic energy is 


conserved (i.e., no energy is lost from the system). This type of collision can 
occur only between atomic particles, although in physics problems we often 
assume elastic collisions between colliding bodies. In actuality, there are no 
elastic collisions, but in some the energy loss is very small and it may be 
considered negligible. An inelastic collision is one in which kinetic energy is 
not conserved (e.g., some energy is lost to friction, crumpled fenders, or 
such). 


6-5a Elastic Collisions 
Example 6-6 


A neutron with a mass of m = 1 u (atomic mass unit) strikes a larger atom 
at rest and rebounds elastically along its original path with 0.9 of its initial 
forward velocity. What is the mass M, in atomic mass units, of the atom it 
struck? 


Solution Let vp be the initial velocity of the neutron and uy; = —0.9 vp 
its final velocity. Note that the problem tells us that it rebounds; therefore 
the direction of the final velocity is opposite to its initial velocity. Let M be 
the mass of the atom, Vo its initial velocity, and V; its velocity after collision. 
Both momentum and kinetic energy are conserved. From the conservation of 
momentum 


MUg + MV = MoUs = MV; 


and on rearranging and using the fact that Vo = 0 


m(Vo — Up) 
a 
/ M 
_ Lud + 0.9 9) — (1 u)(1.9 vp) 
7 M M 
From the conservation of kinetic energy 
1 1 1 1 
5 Nv + 5 MV5 = 5 oF = 5 MV; 


Solving for M, noting that Vo = 0, we obtain 
m(vo — vf) _ (1 u)(0.19 v9) 
Vi Vi 
If we substitute for V; from the momentum equation into the energy equation, 
we obtain 


M = 


_ (1 u)(0.19 v8) M? 
(1 u2)(3.61 v2) 
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Multiflash photograph of a col- 
lision between a moving ball 
coming in from the left and a sta- 
tionary ball. 
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which simplifies to 


_ (1 u2)(3.61) 
~ (1 u) (0.19) 


= 19u 


Example 6-7 


An important type of elastic collision at the atomic level, whose results we 
will use later, is the collision between a very small mass particle, such as an 
electron, with another particle of comparatively large mass, such as an atom. 
The mass of a copper atom, for example, is about 10° times that of an electron. 
In this type of collision one is often interested in finding the velocity of the 
electron after the collision with the copper atom. To solve this type of collision, 
we follow the usual procedure of conserving momentum and kinetic energy. 
We will assume a one-dimensional collision. 


Solution Letm, vo, and v, be the mass and the initial and the final velocity 
of the electron and M, Vo, and V; those of the atom. From the conservation 
of momentum law 


mvyg + MVo = mu, + MV; 
and, on rearranging, 
M(Vo — Vs) = mvs — Vo) (6.13) 


Conserving kinetic energy yields 


= mos + 5 MV3 = = mop + > MV} 
or 
M(V3 — V?) = m(v? — v9) 
On factoring, 
M(Vo + VA(Vo — Vp) = m(v¢ + Vo)(U¢ — Vo) (6.14) 
Dividing Eq. 6.14 by Eq. 6.13 obtains 
Vo + Vz = U0 + YF (6.15) 


For simplicity, let us have the atom initially at rest, Vo = 0, and substitute 
V; of Eq. 6.15 into Eq. 6.13, 


—M (vp + v0) = m (UF — V) 


—Mvo — Mou; = mu; — mv 


Rearranging, 
—v,(M + m) = vo(M — m) 


__ (M = m) 


OME + mm) 


(6.16) 


For our case m << M,; therefore Eq. 6.16 reduces to 
Uf ~ — Uo (6.17) 


The electron rebounds (recoils) with the same magnitude of velocity; thus 
it does not lose any kinetic energy; therefore the atom does not gain any and 
is not set in motion. This is, of course, an approximation. If the mass of the 
electron is taken as 1 unit and the mass of the atom as 10° units and these 
numbers are substituted into Eq. 6.16, then vy = — 0.99998 vp. Because kinetic 
energy is proportional to v*, the remaining energy of the recoiling electron is 
0.99996 of its initial kinetic energy. Therefore, in the collision 0.00004 of the 
initial kinetic energy of the electron has been transferred to the atom. This 
concept will be important later when we develop the loss of electron energy 
to atoms in an electrical conductor in which electrons flow as a current. The 
increase in energy of the atoms from electron collisions manifests itself as an 
increase in temperature of the conductor. 


6.5b Inelastic Collisions 


In all the preceding examples, the energy of the system was unchanged by 
the collision. We now give an example of a collision in which the energy is 
changed by the collision. | 


Example 6-8 


A ballistic pendulum is used to measure the velocity of a bullet. The bullet 
is shot into a wooden block suspended by strings. It lodges in the block, 
losing energy in its penetration, and the increase in the height of the swinging 
block and bullet is measured (see Fig. 6-7). If the bullet has a mass of 0.01 
kg, the block has a mass of 0.5 kg, and the swing rises 0.1 m, what was the 
velocity of the incident bullet and what fraction of its energy was lost during 
penetration? 


Solution We first conserve momentum between situation (a) and (b) in 
Fig. 6-7. 
Po = Py 
mv +0 = (m+ M)V 


0.01 kg v = 0.51 kg V 


(0) 


(0) 
FIGURE 6-7 
Example 6-8. 
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We saw in Section 5.6 in the discussion of the pendulum that the string does 
no work on the block. We may therefore conserve mechanical energy between 
situations (b) and (c) in Fig. 6-7. 


(Ex), = (E,). 
= (mn + M)V? = (m + M)gh 


V = V2gh = 1.4 m/sec 
Substitute this value into the momentum equation and obtain 


_ 0.51 kg 
~ 0.01 keg 
We find the fraction of the bullet’s initial energy lost in penetration by cal- 


culating the energy of the system before (situation a) and after the collision 
(situation b). 


x 1.4 m/sec = 71.4 m/sec 


V0 


(Ex = x mv* = = (0.01 kg)(71.4 m/sec)? = 25.5 J 


Nie Nie 


(Exy = =(m + M)V2 = = (0.51 kg)(1.4 m/sec)? = 0.5 J 


The fraction remaining is 


(Exde 


0.5 
Fraction remaining = (E,) = a = 0.02 
kJa . 


and the fraction lost is 
Fraction lost = 1 — fraction remaining = 0.98 
When colliding objects stick together we find that the kinetic energy is 


not conserved. We should note that energy can be lost, and therefore the 
collision is inelastic, in certain cases where objects do not stick together. 


PROBLEMS 

6-1 The equilibrium separation between the cen- proton (m = 1.67 X 10~’’ kg) in a circular orbit of 
ters of the sodium (m = 23 u) and chlorine (m = radius r = 0.5 X 107!° m. Where is the center of 
35 u) ions in the sodium chloride molecule is 2.4 x mass of the hydrogen atom? 


be | lecule? 
10° m. Where is the center of mass of the molecule: 6.3. What are the x and y coordinates of the center 


6.2 In the Bohr model of the hydrogen atom, the of mass of the system of particles shown in Fig. 
electron (m = 9.1 x 107°! kg) revolves around a 6-8? 


FIGURE 6-8 
Problem 6.3. 


xn) —- 


6.4 A thorium nucleus (m = 232 u), at rest at the 
origin of a coordinate system, decays into a radium 
nucleus (m = 228 u) and an alpha particle (m = 
4 u). Sometime later, the alpha particle passes the 
point x = 3m, y = 2mwitha velocity v = 2 x 10° 
m/sec. What is the position and the velocity of the 
radium nucleus at that moment? 

(Answer: x = 5.3 X 10°? m, y = 3.5 X 1077 m, 

v = 3.5 X 10* m/sec.) 


6.5 Two particles of mass m, = 1 kg and m, = 
99 kg are held 2 m apart. The particles attract each 
other with a constant force directed along the line 
joining the two particles. (a) When the particles are 
released, where will the collision occur? (b) Does the 
answer to (a) depend on the actual value of the force? 

(Answer: (a) 1.98 m from my, (b) no.) 


6.6 A0.25-kg baseball has an initial velocity toward 
a bat of 15 m/sec. The batter strikes the ball and it 
goes out in the reverse direction at 30 m/sec. (a) What 
is the change in the momentum of the ball? (b) What 
is its change in kinetic energy? 


6.7 A swimmer in a pool makes a racing turn at 
the end by suddenly straightening his legs while his 
feet are pressed against the end of the pool. If his 
mass is 80 kg and he exerts an average force of 
120 N for 0.8 sec, what is his initial velocity on leav- 
ing the pool end? 


6.8 Acar crashes into a tree. If the car has a mass 
of 1200 kg and its speed is reduced from 30 m/sec 
to zero in 0.2 sec, what is the average force exerted 
by the tree on the car? 


6.9 A fire hose delivers water at the rate of 20 kg/sec 
with a speed of 25 m/sec. A riot police officer uses 
the hose to control an unruly crowd. The water from 
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the hose strikes a person horizontally and then falls 
down to the ground. What is the average force ex- 
perienced by that person? 


(Answer: 500 N.) 


6.10 A gun fires a 0.01-kg bullet with a velocity of 
250 m/sec at a 0.5-kg melon resting on a post. The 
bullet penetrates the melon and leaves the back of 
it with a velocity of 100 m/sec. With what velocity 
and in what direction does the melon leave the post? 


6.11 A radium atom at rest with a mass of 226 u 
suddenly emits an alpha particle of mass 4 u with a 
speed of 2 x 10’ m/sec. With what speed and in 
what direction does the resulting radon atom of mass 
222 u move? 


6.12 A truck of mass5 x 10% kg moving at 20 m/sec 
collides head-on with a car of mass 1 x 10° kg mov- 
ing at 25 m/sec in the opposite direction. If they stick 
together after the collision, in what direction and 
with what speed do they move immediately after 
the collision? 


6.13 Anatom of mass 10 u strikes a stationary atom 
of mass M and rebounds elastically with one half its 
Original velocity. What is the mass of the atom it 
struck? 


(Answer: 30 u.) 


6.14 A sled of mass 10 kg slides on level, friction- 
less ice with a velocity of 12 m/sec. It collides elas- 
tically with another sled of different mass pointed 
in the same direction but at rest. After the collision, 
the first sled continues in the same direction but with 
a velocity of 4 m/sec. What is the mass of the second 
sled and its velocity after the collision? 

(Answer: 5 kg, 16 m/sec.) 


6.15 A 9000-N open-top railroad car is coasting with 
a velocity v = 10 km/h on a frictionless horizontal 
track. A 1200-kg meteorite falls vertically into the car 
with a velocity of 200 km/h. (a) What is the velocity 
of the railroad car after the meteorite lands on it? (b) 
What is the magnitude and the direction of the im- 
pulse of the meteorite on the car? 


6.16 An object at rest in space explodes into three 
equal parts. The velocities of two of them are, re- 


80 MOMENTUM AND COLLISIONS 


spectively, 2i and —4j. Find the resulting velocity of 
the third part. 


6.17 A bullet of mass 80 g is moving east with a 
velocity vp. The bullet strikes a 200-g wooden block 
moving south with a velocity of 2 m/sec. The bullet 
remains embedded in the block, which then moves 
in the direction 37° south of east. (a) What is the 
velocity of the block after the collision? (b) What was 
the initial velocity vo of the bullet? (c) What is the 
fractional change in the energy of the system? 
(Answer: (a) 2.37 m/sec, (b) 6.65 m/sec, 
(c) 64% decrease.) 


6.18 Two particles of mass m, = 5 kg and m, = 
2 kg move toward each other as shown in Fig. 6-9. 
After the collision, they stick together. (a) What is 
the speed of the particles after they collide? (b) What 
is the direction of motion of the particles after the 
collision? (c) What is the change in the total kinetic 
energy of the particles? 


i LO m/sec 
— ; $f 
Us Le msec 


FIGURE 6-9 
Problem 6.18. 


6.19 A puck sliding on a frictionless table with a 
velocity v = 2 m/sec strikes a second puck of equal 
mass initially at rest. The collision is elastic, and it 
is found that after the collision both pucks move 
with the same speed. (a) What is the speed of the 
pucks after the collision? (b) What is the angle be- 
tween the directions of motion of the pucks? 

(Answer: (a) 1.41 m/sec, (b) 90°.) 


6.29 Three boys stand on a 10-kg wagon resting on 
a frictionless horizontal surface. The boys take turns 


running off the same end of the wagon with a ve- 
locity of 1.5 m/sec relative to the wagon. The mass 
of each of the boys is 40 kg. What is the final velocity 
of the wagon? 

(Answer: 7.87 m/sec.) 


6.21. A 2-kg block rests on the ground. The coef- 
ficient of friction between the block and the ground 
is 0.4. A man fires a 0.01-kg bullet parallel to the 
ground. It lodges in the block, and the block and 
bullet are observed to slide 2 m before coming to 
rest. What was the velocity of the bullet? 

(Answer: 796 m/sec.) 


6.22 A block of mass 1 kg rests over a hole in a 
tabletop. A bullet of mass 0.01 kg is fired upward 
into the block with a velocity of 200 m/sec. If the 
bullet imbeds itself in the block, how high will the 
block rise? 


6.23 Apbullet of mass 100 g is shot into a 3-kg wooden 
block resting on an incline plane, as shown in Fig. 
6-10. The bullet remains embedded in the block, which 


_ then slides down the incline plane 2 m before coming 


to rest. The coefficient of friction between the block 
and the incline is 0.5. What was the initial velocity 
of the bullet? 

(Answer: 91.9 m/sec.) 


FIGURE 6-10 
Problem 6.23. 


6.24 A bomb is launched with a velocity vp = 
500 m/sec at an angle of 37° with the horizontal. At 
the highest point of the trajectory it explodes in two 
equal pieces. One piece lands 20 sec later directly 
below the point where the explosion occurred. When 
and where does the second piece land? 

(Answer: 77.8 sec, 4.99 x 10* m from launching.) 
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7.1 INTRODUCTION 


In the preceding chapters we dealt with translational motion—that is, the 
change of position in the cartesian coordinate system of the center of mass 
of a body. However, some systems simply rotate and some rotate while the 
center of mass translates through space: A roulette wheel simply rotates, 
whereas a car wheel both rotates and translates. Newton’s laws and mo- 
mentum and energy conservation still apply, but the formulation is somewhat 
different. In this chapter and the next we will consider rotational motion. We 
need these properties in order to build the model of the electron rotational 
motion in atoms. 


7.2 MEASUREMENT OF ROTATION 


The most common measurement of rotation is a count of the number of 
revolutions about an axis of rotation. We also use degrees as a measure, 
where 360° corresponds to one revolution. In physics we mostly use radians 
for a variety of reasons. One of these reasons is that the formulation affords 
a quick and easy bridge between linear and rotational motion. Let us examine 
this. | 

A measure of an angle in radians is the length of the circular arc subtended 
by the angle divided by the radius of the circle (see Fig. 7-1). If the length of 
the arc from a to b is s and r is the radius, then the measure of angle @ in 
radians is given by | 


@ (in radians) = (7.1) 


“= 1D 


Because both s and r¢ are in units of length, the units cancel on the right 
side and @ is dimensionless. Other measures of 0, such as degrees or revo- 
lutions, are also dimensionless. However, the numerical magnitudes of these 
quantities differ, so we must state the system of measure used. Obviously, 
we must maintain a consistency of angular measure in a given problem. 

Returning to Eq. 7.1, we may ask how many radians there are in a whole 
revolution. The arc length subtended by a revolution is the circumference, or 
27r. Therefore, 


s of 1 revolution 2ur ; 
= — = 27 radians (rad) 
radius r 


We may convert between systems of angular measure as shown in Chapter 
1 using the identities 


2a (radians) = 360 (degrees) = 1 revolution (rev) 


FIGURE 7-1 
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7.3 ROTATIONAL MOTION 


Suppose we have a reference marker a on the x axis of a coordinate system 
and a wheel whose center coincides with the origin. We also have a mark b 
on the wheel. We can measure the time during which the wheel marker b 
moves from the coordinate marker a, a distance As (see Fig. 7-2). The 
speed,_., of the marker on the wheel is measured by the time it takes for the 
marker to move an arc length As, or 


speed, ., = ~ 
In the limit At — 0 the distance As becomes a vector and the speed becomes _‘ FIGURE 7-2 
v, the instantaneous velocity (see Section 3.2) 
_ ds 
ake? (7.2) 

The direction of the instantaneous velocity of the marker on the rotating wheel 
is the tangent to the circle of motion and is called the tangential velocity; it is 
sometimes indicated by writing v with the subscript T. Note that as the marker 
rotates the direction of vr constantly changes even though the marker may 
rotate at a constant rate. Therefore, the vector v; is constantly changing. In 
the previous chapters we have dealt largely with vectors whose direction 
remained constant while the magnitude changed, whereas here we have a 
vector whose magnitude may remain constant while its direction always 
changes. This has important implications in the development of the centripetal 
force that we will consider later in this chapter. 

In Fig. 7-2 we see that while s is increasing, the angle 0 is also increasing 
as the moving radius vector (line from origin to the marker on the circum- 
ference) sweeps out a larger arc. The average rate of change of the angle 6 
with time is called the average angular or rotational velocity; we use the small 
Greek letter w (omega) for this. 


— Aé6 
Oo = — 
At 
and, as At— 0, the average angular velocity w becomes the instantaneous 
angular velocity w, namely, 


vo (7.3) i= 


Because @ is dimensionless, w has units of reciprocal time, although it must 
be specified whether w is radians/second, degrees/second or revolu- 
tions/second. We may relate the tangential velocity to the rotational velocity 
by differentiating Eq. 7.1 with respect to time 
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Rearranging terms and using the definitions of Eqs. 7.2 and 7.3 we obtain 
Or = TOW (7.4) vy = Tw 


where w is in radians/second. 


Example 7-1 


A car is traveling at a constant velocity of 24 m/sec. The radius of its wheels 
is r = 0.30 m. (a) How many revolutions have the wheels turned after the 
car has gone 120 m? (b) How many revolutions have the wheels turned after 
60 sec? 


Solution 


(a) If there is no slipping between the wheels of the car and the road, 
the arc length moved by a marker on the outermost radius of the 
wheel is equal to the distance traveled by the car; that is, s = 120 m. 


Using Eq. 7.1 
Bie S 120 m 
 ¢ 0.30 m 
1 
6 = 400 rad = 400 rad ( a) = O97 79 
27 rad 


(b) Because the car travels at constant velocity, the distance traveled by 
the car in 60 sec can be found with Eq. 3.12, keeping in mind that 
the acceleration a = 0 


x = (24 m/sec)(60 sec) = 1440 m 


This, as we have indicated, is also the arc length moved by a marker on 
the rim of the wheel. We now use Eq. 7.1 to find the angle rotated by 
the wheel in 60 sec. 


0 (ft = 60 sec) =- 


1 rev 


9 = 1440 m 
7 27 rad 


0.30 m 


= 4800 rad = 4800 rad ( ) = 764 rev 


Suppose the marker on the rotating wheel of Fig. 7-2 is not rotating at a 
constant rate but is speeding up or slowing down. Then from Eq. 7.4 the 
marker on the wheel has an average tangential acceleration that from Eq. 3.4 
is 
el 

At 


Vif — VrTo 


At 
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In the limit as At — 0, ar becomes the instantaneous acceleration a,, that is, 
(7.5) 
The rate at which the angle 0 is being swept out, the angular velocity w, 


is also changing. We call the rate of change of w the average angular or rotational 
acceleration and use as the symbol the small Greek letter a (alpha), so that 


5 = fe 
~~ At 
To find the instantaneous value we let At — 0, and 
dw eas dw 
a => dt (7.6) c= dt 


Angular acceleration has dimensions of (time) ~*, although, as before, we 
must specify the measure of the angle. We may connect the angular accel- 
eration with the tangential acceleration of the marker on the wheel by dif- 
ferentiating Eq. 7.4 with respect to time. 


Ls a 
dt — dt 
ar = ra (7.7) a; = ra 


where a is in radians/second squared. 

Because the radian is a dimensionless quantity, the units of a; will be 
the same as those of r divided by second’. Thus if r is expressed in meters, 
ar will be in m/sec’. 


Example 7-2 


A driver of a car traveling at 24 m/sec applies the brakes, decelerates uniformly, 
and comes to a stop in 100 m. If the wheels have a radius of 0.30 m, what is 
the angular deceleration of the wheels in rev/sec’? 


Solution” There are several ways to solve this, but let us use the most 
straightforward one, finding first the linear deceleration and then relating it 
to rotational deceleration 


Vo = 24 m/sec, uy» = 0, x = 100m, a=? 
From Eq. 3.11, 
Up — UO = 2ax 
ve — v9 — 0 — (24 m/sec)? _ 


ws Oe ee, es EO cee 
. Ox (2)(100 m) meal 


Because a is the acceleration of the car, it is also the tangential acceleration 
of every point on the rim of its wheels (assuming no slipping between the 
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wheels and the road). By Eq. 7.7 


ar = ra 
_ ar _ —2.88 m/sec? 
> 0.30 m 
rad 1 rev 
a= - 6 an aa — 1.5 rev/sec? 


7.4 EQUATIONS OF ROTATIONAL MOTION 


We may derive the equations of rotational motion by the method of Chapter 
3. As in Chapter 3, we will limit our discussion to the case of constant angular 


acceleration. 
The basic relations obtained by arguments analogous to those of Eqs. 3.7 


and 3.9 are 
6 = oat (7.8) 


_ Wo + OF 

a (7.9) 
We may obtain the other three relations analogous to Eqs. 3.8, 3.11, and 3.12 
by integration for the condition that a = constant. 

From the definition of a, Eq. 7.6, 

_ dw 

dt 


W = W + at (7.10) 
From the definition 
_ dé 
Ut 


Substitute Eq. 7.10 for w 


ei] t 
J ao= | oat 
8 0 
6 
dé 


t t t 
J, (eo + atydt = 0 f dt +a f tat 
8 0 0 0 


1 
6 — Ao = Wot = 2 at? (7.11) 


d= wt 

= Mg + cu; 

y= J 
5 

iW = W@W + al 


L 
7 = Ao + Wol + 5 at 


— 
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To obtain our final equation, we use the chain rule to write 
dw dwdé dw 
dt =dodt “do 
Multiply both sides by dé 
ad@ = wdw 


and the integration is 


6 @ 
a d@ = | w dw 
% Wo 
1 
a(@ — @) = 5 (wo — @) 


which is usually written in the form 
w* — we = 2a(0 — 4) (7.12) w* = a + 2a(@ — Gp) 


Example 7-3 


A roulette wheel is given an initial rotational velocity of 2 rev/sec. It is observed 
to be rotating at 1.5 rev/sec 5 sec after it was set in motion. (a) What is the 
angular deceleration (assumed constant) of the wheel? (b) How long will it 
take to stop? (c) How many revolutions will it make from start to finish? 


Solution 
(a) @ = 2.0 rev/sec w = 1.5 rev/sec t =5sec a=? 
W@W = W + at 


@— @ _ 1.5 rev/sec — 2.0 rev/sec 
t 7 5 sec 


a = —0.1 rev/sec? 
(b) w) = 2rev/sec w= 0 a@ = —O0.1rev/sec? t = ? 
Wr = Wo + at 


@ — @  O — 2 rev/sec 


ae 
II 


a ~ —0.1 rev/sec? 
Ey = 20 sec 
(Cc) @ = 2rev/sec wa = 0 t = 20sec 0=? 
Wy +a, — 2revi/sec + 0 
a a an ne ne 


6 = 20 rev 
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7.5 RADIAL ACCELERATION 


Let us consider more carefully the motion of the marker on the wheel in Fig. 
7-2 as the wheel rotates at constant speed. In Fig. 7-3a we have the same 
wheel with the velocity vectors indicated at points a and b. We see that even 
though the velocity vectors at points a and b may have the same magnitude, 
their direction is different. This difference is indicated in Fig. 7-3b by the 
vector Av,. In this figure the vector tails have been put at a common point. 
Thus, in a time At the vector v, has changed in value by Av,. This implies 
an acceleration has taken place. Because a velocity vector of a point on a circle 
is always tangent to the circle, it is perpendicular to the radius. For infini- 
tesimal changes At and thus Av,, there is an acceleration ag inward along 
the radius called a radial acceleration ag given by 


are = lim ava 
R Arso Ate0 


We will now examine this radial acceleration analytically. We see from 
Fig. 7-4a by the method of vector components of Chapter 2 that the coordinates 
of the marker at some time t are 


x = rcosé@ 
(7.13) 


y =rsin@ 
Let the marker rotate about the circle at a constant rotational velocity w so 
that w@ = w. Substitute Eq. 7.8 into Eqs. 7.13 and obtain 


x = rcos at 
(7.14) 


y =rsin at 
The x component of the velocity of the marker in Fig. 7-4b is v, = dx/dt, and 
the y component is v, = dy/dt. Performing this differentiation of Eqs. 7.14 
yields 


d . 
Vv, = 7 at (cos wt) = —rw sin ot 


(7.15) 


d_. 
vy =FT it (sin wt) = rw cos wt 


We see that in Eqs. 7.15 both v, and v, are functions of time and therefore 
the point must be accelerating in both the x and y directions. We may obtain 
the components of acceleration by differentiating Eqs. 7.15 with respect to 
time: 


dv,. ee d e == 2? 
a, = 7 rw Ak (sin wt) = —rw* cos wt 
(7.16) 
dv d : 
, = it = 1a (cos wt) = —re? sin wt 


(0) 
FIGURE 7-3 


(0) 
FIGURE 7-4 


The square of the resultant acceleration az is the sum of the squares of the 
components, or 


aR = az + ay 


ax = r*w'cos*wt + 1r’w'tsin2t 


r?w*(cos*wt + sin?wt) 
Using the trigonometric identity that 


sin?@ + cos*é@ = 1 


we obtain 


or 


Op = tr? (7.17) 


The direction of ag can be found by comparing Eq. 7.16 with Eq. 7.14. It is 
seen that a, is w* times the negative x coordinate of the radius vector r and a, 
is w* times the negative y coordinate of r. This implies that az = —w’ r, and, 
consequently, the direction of az is along the radius toward the center, that 
is, opposite to the vector direction of the radius. This is sketched in Fig. 7-5, 
where the resultant az is seen to be directed inward along the radius toward 
the center. 


7.6 CENTRIPETAL FORCE 


Newton’s second law, F = ma, states that if there is a net force on a body 
there is an associated acceleration. The converse is true; if there is an accel- 
eration there must be a net force. We have shown in the previous section 
that a particle or a body moving in circular motion at constant speed is being 
accelerated inward along the radius. Therefore, the particle must be acted on 
by a force along the radius toward the center. This situation corresponds to 
the statement of Newton's second law that “If a body in a state of motion is 
acted on by an external force, it will be accelerated in the direction of the 
force.”” The particle cannot undergo circular motion unless there is a force 
along the radius directed inward toward the center. This force is called the 
centripetal (center-seeking) force. A demonstration of this is easily performed 
by whirling a weight at the end of a string in a circle. You must exert a 
constant force (tension in the string) to maintain the motion. If you let go of 
the string, the weight will fly off in a straight line with a velocity whose 
direction will be the tangent to the circle at the point of release. 

We indicate radial (or centripetal) force by Fr. We may use Newton’s 
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FIGURE 7-5 


ap = 


ra 
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second law to write 


Fe = Mar (7.18) 
or, using Eq. 7.17 
XFR = mro* (7.19) 
Another convenient form is obtained by substituting Eq. 7.4, vr = rw for w 
tie = mat (7.20) SFR =m z 


In the solution of problems involving radial acceleration, two rules must 
be observed, based on the derivations: (1) ag has dimensions of m/sec? and 
therefore w must have dimensions of rad/sec?; and (2) radial forces directed 
toward the center of rotation are positive, whereas those directed away from 
the center are negative. We also note from Section 3-2 that the magnitude of 
the instantaneous tangential velocity at any point is equal to the speed. 


Example 7-4 


A person whose weight is 600 N is riding a roller coaster. This person sits 
on a scale as the roller coaster passes over the top of a rise of radius 80 m. 
(a) What is the minimum speed of the car if the scale reads zero (the sensation 
of weightlessness is experienced)? (b) If the car increases its speed to 40 m/sec 
in descending to a dip with a radius of 80 m, what will the scale read? See 
Fig. 7-6. 


Solution Let us consider the forces on the rider at the rise. The rider’s 
weight, mg, is directed toward the center. The scale exerts a normal force N 
upward. N is the reading of the scale. | 


2 

OT 

LFe = m— 

T 

po 2 
UT 

mg -N=m— 


If the scale reads zero, N = 0 and 


UT 
me = m-— 
8 “7 
vr = Ver 


= V9.8 m/sec? X 80 m = 28 m/sec 


In the dip vr = 40 m/sec, mg is downward directed away from the center FIGURE 7-6 
whereas N’ now is upward toward the center. Following our sign convention Example 7-4. 
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we write 
—-mg + N = m— 


vr 


N' = mg + m—~ 


2 
600 N + ( 600 N er m/sec) 


9.8 m/sec? (80 m) 
600 N + 1224N = 1824N 


Notice that now the scale reads more than three times the person’s weight. 


7.7 ORBITAL MOTION AND GRAVITATION 


Johannes Kepler (1571-1630), a German astronomer and mathematician, plot- 
ted the orbits followed by the planets around the sun. He found three em- 
pirical rules for planetary motion, the first two were published in 1609 and 
the third in 1621. The reason for planetary behavior was not known until 
Newton found that he could derive Kepler’s rules if he postulated a universal 
gravitational law. Namely, any two bodies are gravitationally attracted to each 
other by a force proportional to the product of their masses (m,m 2) and in- 
versely proportional to the square of the distance between them, r?. If we 
call the proportionality constant G, the universal gravitational constant, we may 
write 

Mm,M 
2 


F=G (7.21) 


‘3 
The value of this constant is G = 6.67 x 107"! Nm?/kg?. Such a small number 
was not measurable in Newton’s time, and it was first measured in 1798 by 
Henry Cavendish (1731-1810). Newton found a method of performing cal- 
culations without it. For example, he was able to calculate the acceleration of 
gravity at the earth’s surface, g = 9.8 m/sec*, which compared favorably with 
the experimental measurement of the acceleration of falling bodies. He rea- 
soned as follows. Let m, be the mass of the earth, m, the mass of an object 
near the surface of the earth, m,, the mass of the moon, r. the radius of the 
earth, and r,,, the distance from the center of mass of the earth to the center 
of mass of the moon (assume constant radius of the moon’s orbit). 

At the surface of the earth the force on an object is its weight m,g. This 
is equal to the force of gravity between the object and the earth, as given by 
Eq. 7.21. Considering all the mass of the earth to be at its center of mass, 
which is the geometrical center of a sphere, then 


MMe 


re 


Mog = G (7.22) 


1,M> 
F= G-> 


7* 


ks 


ON Soot 

f Me “igs r i | ls 
Nags “ Y Aree, 
SOWA eee 


Henry Cavendish (1731-1810). 
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and 


2 
G= (7.23) 


The moon is also attracted to the earth by the gravitational force 


MmpMe 


2 
Tem 


F=G (7.24) i 


This force is the centripetal force, which from Eq. 7.20 is i) 


Ut 


FR = Mp 


Tem 
Substituting Eq. 7.24 for this force yields 


MpMe  MpVFT 
G+ = 
a i 


—— 


Torsion balance used by Cav- 
from which endish to determine the univer- 
sal gravitational constant G. 


G= OF Tem (7.25) 
Me 
Equate the G’s of Eqs. 7.23 and 7.25 
Te _ UT Tem 
She the 
from which 
& ~ en (7.26) 


He had the quantities r.,.. = 3.8 x 10®mandr, = 6.3 x 10° m measured by 
astronomers. vr; is the speed of the moon, which is the distance around its 
orbit 27r,,, divided by the period of rotation of the moon around the earth, 
27.3 days (2.36 x 10° sec) 


_ 2a X 3.8 X 108m 


= = 1.01 x 10° m/ 
or "2.36 X 10sec : — 
Substituting these numbers into Eq. 7.26 results in 


_ (1.01 x 10° m/sec)* (3.8 x 10°m) 
7 (6.3 X 10°m)? 


9.8 m/sec? 


And thus Newton was able to verify his gravitational law. 
Later, when Cavendish measured G, the mass of the earth and the sun 
could be calculated (see Example 7-5). 


PROBLEMS 
Example 7-5 
The radius of the earth is r, = 6.3 X 10°m and G = 6.67 X 107" Nm/?/kg?. 
Find the mass of the earth. 
Solution Use Eq. 7.22 
fim. 
thos = G3 


2 


12 8 

G 

_ (6.3 X 10°m)*(9.8 m/sec’) 
6.67 x 1071! Nm2/ke? 


Me = 


= 5.8 x 10% kg 


The same answer can be obtained if we use Eq. 7.19 instead of Eq. 7.22. 
Fr = mrw* 
Applying this to the motion of the moon around the earth, we write 


The rotational velocity must be in rad/sec. One orbit of the moon is 27 rad, 
which it completes in 27.3 days. Therefore, w = 27/27.3 day = 27/2.36 x 
10° sec = 2.66 X 10~® rad/sec. When this value is used the same answer is 
obtained for m,. 


2 nO 
G 


_ (3.8 x 10%m)°(2.66 x 107? rad/sec)’ 
- 6.67 x 10°" Nm/ke? 


= 5.8 x 10% kg 


me = 


PROBLEMS 


93 


7.1 Awheel of radius 0.5 m is rotating at 120 rev/min. 
(a) What is its rotational speed in rad/sec? (b) What 
is the tangential velocity of a point of the rim? (c) 
How many radians does the wheel turn in 10 sec? 
(d) If the wheel were rolling on the ground, what 
distance would it travel in 10 sec? 


7.9 Calculate the angular velocity of the hour hand, 
the minute hand, and the second hand of a wrist- 
watch. 


7.3 A wheel rotating at 5 rev/sec coasts to rest in 
30 sec. (a) What is its deceleration in rev/sec* and in 
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rad/sec’? (b) If the radius of the wheel is 0.4 m, what 
is the tangential acceleration of a point on the rim? 
(c) Through how many revolutions did the wheel 
turn in coming to rest? 
(Answer: (a) —0.167 rev/sec?, — 1.05 rad/sec?, 
(b) —0.42 m/sec?, (c) 75 rev.) 


7.4 A bicycle with a wheel radius of 0.34 m is trav- 
eling at 10 m/sec. What is the rotational speed of the 
wheels? 


7.5 A wheel rotating at 10 rev/sec makes 1000 rev 
while coasting to a stop with constant deceleration. 
How long did it take to stop? 


7.6 A wheel of radius 2 m starts rotating with con- 
stant angular acceleration a = 1.5 rad/sec*. What 
are the tangential and radial accelerations of a point 
on the rim after the wheel has rotated 207 rad? 


7.7 A pulley of radius r, = 8 cm is connected to 
the shaft of an electric motor. A belt couples the 
pulley to a wheel of radius r, = 24 cm (see Fig. 
7-7). The motor shaft begins to rotate with an angular 
acceleration a = 25 rad/sec’. (a) What is the angular 
velocity of the wheel after 3 sec? (b) Through what 
angle has the wheel rotated when the centripetal 
acceleration of a point on the rim of the wheel is 
100 9? (g = 9.8 m/sec’) 

(Answer: (a) 25 rad/sec, (b) 245 rad.) 


FIGURE 7-7 
Problem 7.7. 


7.8 A wheel makes 40 rev in:2 sec. The angular 
velocity at the end of the 2-sec time is 18 rev/sec. (a) 
What is the angular velocity at the beginning of the 
2 sec? (b)What is the angular acceleration (assume 
it to be constant) of the wheel? 


7.9 Assume that the orbit of the earth around the 
sun is circular and that the period of rotation is 365 
days. The earth-sun distance is 1.5 x 10!’ m. What 
is the centripetal acceleration of the earth resulting 
from its motion around the sun? 


7.10 A0.4-kg object on a string 0.5 m long attached 
to a pin on a frictionless table is made to rotate. If 


the breaking strength of the string is 20 N, what is 
the maximum rotational speed? 
(Answer: 10 rad/sec.) 


An object of mass 0.2 kg on a 0.4-m string is 
whirled in a vertical circle. (a) If the rotational speed 
is slowed until the object just completes the top of 
the circle with no tension in the string, what is its 
tangential velocity at that point? (b) If the same ve- 
locity is maintained at the bottom of the circle, what 
is the tension in the string at that point? See Fig. 


7-8. 
“T f | ™ i sie 
/ 7 \ f ‘ 
lin | 


FIGURE 7-8 T=0 \ Ir / 
Problem 7.11. i = a 
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7.12 A bug sits on a phonograph record 0.18 m 

from the center. If the record turns at 33 rev/min, 

what is the radial acceleration of the bug? If it has 

a mass of 0.5 gm, what is the centripetal force acting 
FIGURE 7-9 


on it? See Fig. 7-9. 
Problem 7.12. 


7.13 The earth-sun distance is 1.5 x 101! m. If the 
earth goes around the sun once in 365 days, find 
the mass of the sun. Assume the earth makes a cir- 
cular orbit around the sun. 

(Answer: 2 X 10°° kg.) 


7.14 The force of attraction between oppositely 
charged particles is given by Coulomb’s law, which 
has the same form as Newton’s gravitational law 


F= KLE 
r 


where g, and q2 are the charges on the particles in 
Coulombs (C), ris the distance between them, and 
K is a constant. In the Bohr model of the hydrogen 
atom, the electron revolves in a circular orbit around 
the stationary proton. The magnitude of the charge 
on the electron is the same as the charge on the 
proton q, = q2 = 1.6 xX 10°” Cand K = 9 x 10° 
Nm7/C?. The radius of the smallest electron orbit is 
5.3 X 107" m, and the mass of the electron is 9.1 


x 107°! kg. Find the number of rev/sec of the electron 
around the proton, according to the model. 
(Answer: 6.56 X 10’ rev/sec.) 


7.15 What should the duration of a day be in order 
for a person standing at the equator to have the 
feeling of weightlessness, namely, for the normal 
force exerted by the ground on the person to be zero. 
The radius of the earth is 6.37 x 10° m. 

(Answer: 1.41 h.) 


7.16 (a) What is the centripetal acceleration of a 
person standing on the earth at a point of latitude 
45°? (b) What is the magnitude and the direction of 
the force exerted by the ground on that person? Ex- 
press your answer in terms of the weight mg of the 
person. The radius of the earth is 6.37 x 10° m. 


7.17 A 2-kg block is rotating on a frictionless table 
with angular velocity w = 2 rev/sec. The block is 
connected to a 15-kg block by means of a string of 
total length 2 m that passes through a small hole in 
the table (see Fig. 7-10). How far below the tabletop 
does the 15-kg block hang? 

(Answer: 1.53 ~ 


FIGURE 7-10 
Problem 7.17. 


7.18 A1.5-kg mass is attached to one end of a rod 
of length ] = 1 m and negligible weight. The other 
end of the rod is pivoted, and the mass rotates in a 
vertical circle. The tangential velocity of the mass at 
the top of the circle is 3 m/sec. (a) What is the mag- 
nitude and the direction of the force exerted by the 
rod on the mass at the top of the circle? (b) If friction 
at the pivot is negligible, what is the tangential ve- 
locity at the bottom of the circle? (c) What force does 
the rod exert on the mass at the bottom? 

(Answer: (a) 1.20 N upward, (b) 6.94 m/sec, 

(c) 87 N.) 
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7.19 A particle of mass m = 0.7 kg is released from 
rest at point A in Fig. 7-11. It slides down and around 
the frictionless loop. (a) What are the radial and tan- 
gential accelerations at points B, C, and D? (b) What 
is the normal force exerted by the track at those three 
points? (This is the basis for a popular ride in amuse- 
ment parks.) 


AL 


FIGURE 7-11 
Problem 7.19. 


The length of the string of a conical pendulum 
is 0.6 m (see Fig. 7-12). The mass of the bob is 1.2 
kg. The angular velocity of the bob (which moves 
in a circle in the horizontal plane) is such that the 
angle between the string and the vertical is 30° and 
is constant. (a) What is the tension in the string? (b) 
What is the angular velocity of the bob? 
(Answer: (a) 13.6 N, (b) 4.34 rad/sec.) 


FIGURE 7-12 
Problem 7.20. 


7.21 A small particle of mass m is placed on top of 
a stationary, frictionless spherical ball of radius 
0.5 m (see Fig. 7-13). It is given a slight kick to start 
sliding down. (a) Find the tangential velocity of the 
particle when it looses contact with the sphere. (b) 
What is the angle 6 when contact is lost? 

(Answer: (a) 1.81 m/sec, (b) 48.2°.) 


FIGURE 7-13 
Problem 7.21. 
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8.1 INTRODUCTION 


In this chapter we introduce the concepts of rotational dynamics. Previously, 
we developed the first principles of linear dynamics; now we adapt the prin- 
ciples of linear dynamics to rotating bodies. The same laws apply, but their 
formulation is different. We will see, however, that Newton’s laws, momen- 
tum, energy, and power all have equations equivalent to their linear coun- 
terparts. 


8.2 MOMENT OF INERTIA AND TORQUE 


In Newton's second law, mass is the proportionality constant between force 
and acceleration. Newton called it the inertial mass, that is, the resistance of 
a body to having its state of motion changed. We encounter a similar concept 
in rotational motion. Independent of friction, it is easier to spin a bicycle 
wheel on its axle than it is to so spin a car wheel. This resistance to having 
the state of rotational motion changed is called the moment of inertia, with 
symbol I. To demonstrate it in its simplest form, let us consider the rotation 
of a point mass m at one end of a rigid massless rod of length r. Let the other 
end of the rod be fastened to a point of rotation so that the system can rotate 
in the plane of the paper, as in Fig. 8-1. Suppose a force F is applied to the 
mass in the direction shown. We construct cartesian coordinates with the 
origin at m and x’ as an extension of r. The force F has two components 
obtained by constructing the indicated lines perpendicular to the x’ and y’ 
coordinate axes. The x’ component Fr = F cos ¢ is in the direction of r. But 
because the rod is rigid, there can be no motion of the mass in the x’ direction. 
The component in the y’ direction is F sin @. It should be observed that by 
construction this component is tangent to the circle of rotation at the point 
where m is located, so Fr = F sin @. By Newton’s second law, this tangential 
force causes a tangential acceleration 


Fr = mar (8.1) 
From Eq. 7.7 
ar = ra (7.7) 
Substituting for Fr and a; in Eq. 8.1 yields 
F sin @ = mra 
Now multiply both sides of this equation by r 
rF sin @ = mr7a (8.2) 


From Fig. 8-1, r sin ¢ on the left side of Eq. 8.2 is equal to h, the perpendicular 
distance from the origin of the x-y coordinate system 0 to the line of F. Eq. 
8.2 can therefore be rewritten as 


U hy 
I 


Fh = mr’a (8.3) FIGURE 8-1 
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where 
Fh = Frsind 


The quantity Fh = Fr sin @, the product of a force times the perpendicular distance 
from the point of rotation to the line of the force, is called the torque produced by 
F, which is usually represented by 7, the small Greek letter tau. The quantity 
mr? on the right side of Eq. 8.2 is called the moment of inertia, I, of a point 
mass. We may therefore write Eq. 8.3 as 


7 = Ie (8.4) 


This is Newton’s second law, which governs rotation. When compared with 
F = ma, we see that 7 corresponds to force, I to mass, and a to linear accel- 
eration. Just as in the linear case, 7 must be the net torque and, if it is zero, 
there is no angular acceleration. Note that from the original definition, Eq. 
7.6, the units of a are rad/sec?. 

It should be pointed out that if the force in Fig. 8-1 did not lie in the 
x-y plane but in some other plane while making the same angle with r, the 
torque would still have the same magnitude, rF sin ¢, but the ensuing plane 
of rotation would not be the same. This ambiguity can be removed by as- 
signing a direction to 7. It is conventional to define 7 as the cross product of 
the position vector r and the force vector F, namely, 


T=rxF (8.5) 


From the definition of the cross product, Eq. 2.2, the magnitude of vis rF sin ¢, 
which is the same value assigned previously. Moreover, the direction of 7 is 
the perpendicular to the two vectors being crossed, r and F, according to the 
right-hand rule discussed in Chapter 2. In the case illustrated by Fig. 8-1, 7 
is perpendicular to the plane of the paper outward. Equation 8.5 defines + 
unambiguously. 


Example 8-1 

A balance scale consisting of a weightless rod has a mass of 0.1 kg on the 
right side 0.2 m from the pivot point. See Fig. 8-2. (a) How far from the pivot 
point on the left must 0.4 kg be placed so that a balance is achieved? (b) If 
the 0.4-kg mass is suddenly removed, what is the instantaneous rotational 
acceleration of the rod? (c) What is the instantaneous tangential acceleration 
of the 0.1-kg mass when the 0.4-kg mass is removed? 


Solution 
(a) When a balance is achieved a = 0 and therefore 
xT = 0 


On the right of the pivot point the force is m,;g downward and the 
cross product r X F is into the paper or negative. On the left the 


FIGURE 8-2 
Example 3-1. 
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force is m2g downward and the cross product r X F is out of the 
paper or positive. 
(1m2)(x)sin 90° — (mg)(0.2 m)sin 90° = 0 
Solving for x 
(™m1g)(0.2 m) sin 90° 
a ee 
(12g) sin 90° 


_ (0.1 kg)(9.8 m/sec2)(0.2 m) 
7 (0.4 kg)(9.8 m/sec?) 


= 0.05 m 


(b) + = la 
(mg)(0.2 m) sin 90° 
(™m)(0.2 m)? 


2 = (0.1 kg)(9.8 m/sec’)(0.2 m) sin 90° 
(0.1 kg)(0.2 m)? 


wate 
=i - 


a = 49 rad/sec? clockwise 


(c) ar = ra 
(0.2 m)(49 rad/sec?) 


= 9.8 m/sec? 


As expected, the answer to part (c) is the acceleration of a body in free fall. 
The same answer will be obtained for the left-hand weight if the right-hand 
one is removed except that the rod will rotate counterclockwise. 


In Eq. 8.3, mr? is the moment of inertia of a point mass at a distance r 
from the pivot point. If there are a variety of masses at different distances 


from the pivot point, the moment of inertia of the assembly is the sum of 
their individual ones or 


I= > mr - (8.6) 


If all the masses are at the same distance r from the pivot point, r? = r? for 
all the terms in the sum and r? can be factored to obtain 


I=r? Dd m, (8.7) 


If we wish to find the moment of inertia of a thin hoop such as a bicycle 
wheel with essentially massless spokes, then the mass of the wheel M is 


KH 
l= 3 mr 


p= | 


simply 
M = >» Mm; 
i=1 


where m; is the mass of each infinitesimal element. Therefore, the moment 
of inertia of a bicycle wheel is approximately 


I = Mr? 


The value of I for spheres, cylinders and such must be either derived by 
Eq. 8.6 or looked up in tables. We should note that unlike the translational 
inertia (the mass), the rotational inertia (moment of inertia) of an object de- 
pends on the location of the mass relative to the axis of rotation and in general 
is different for different axes of rotation (see problems 8.5 and 8.6). 


8.3 ROTATIONAL KINETIC ENERGY 


In Fig. 8-1 the force F was divided into two orthogonal components. It is seen 
that because r is fixed, the component of F in the x’ direction can do no work. 

In an infinitesimally small time interval dt, the tangential component of 
F, F; = F sin ¢ causes the particle to move an infinitesimal displacement ds, 
which from Eq. 7.2 is given by 


ds = vr dt (8.8) 


Because time is a scalar quantity, the direction of ds is the same as that of vr, 
namely, tangent to the path of the particle and therefore in the same direction 
as Fr (see Fig. 8-3). The work done by Fr in this infinitesimal distance is dW, 
and by the definition of Eq. 5.3 is 


dW = Fr-ds (8.9) 


But because Fy and ds are in the same direction, the dot product in Eq. 8.9 
can be deleted. 


dW = Frds (8.10) 


The tangential displacement of the particle is accompanied by an increase in 
the angle 6 (see Fig. 8-3). The two are related by the differential form of Eq. 
7.1 


do = — (7.1’) 


Substituting F sin ¢ for Fr and r dé for ds into Eq. 8.10, we obtain 
| dW, = Fsin fr dé (8.11) 


We have added the subscript @ to dW to indicate that the work is associated 
with an angular displacement dé. 


ROTATIONAL KINETIC ENERGY 


es Fal 


FIGURE 8-3 
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The product F sin @ r in the right side of Eq. 8.11 may be recognized as 
the torque exerted by the force F (see Eq. 8.5), therefore 


dW, = 7 dé (8.12) 


To find the work done for a finite rotation we simply integrate Eq. 8.12 


0 
W, = [740 (8.13) 


0 


where 6 and @ are the initial and final angles, respectively. Equation 8.13 is 
equivalent to the expression found in Chapter 5 for the work done in trans- 
lation 


W = [Fax (5.6) 


In Chapter 5 we indicated that the net work done on a body changes its 
velocity, or more precisely, its kinetic energy. This was known as the work- 
energy theorem (Eq. 5.9). We can show that the same occurs in the case of 
rotation, while at the same time we will find an expression for the kinetic 
energy of a rotating body in terms of rotational parameters. 

Substituting + = Ia (Eq. 8.4) into Eq. 8.13 yields 


0 
W, = J ‘Ta dO (8.14) 


But, by definition, a = = and d@ = w dt; therefore, 


% da 
Wo = [1S wd 
The time dt cancels out in the integral, and we get 


, oy 

We = Iw dw (8.15) 
where we have changed the limits from 9 and 6; to wo (initial angular velocity) 
and a; (final angular velocity), for we now integrate with respect to w. If the 
moment of inertia is constant—that is, if the distance of the particle to the 
point of rotation does not change—then Eq. 8.15 becomes 


W,=I1 ae 


Wo 


1 1 
5 lo? — 5 Iai (8.16) 


= 
I 


w, = [" rao 
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Comparing this with the work-energy theorem of Chapter 5, where we saw 
that work done is equal to the change in kinetic energy, we may write 


1 1 
(AEx)rot = 5 lof = 5 lwo (8.17) 


1 ee 
where the quantity 5 Iw? is called the rotational kinetic energy, (Ex)rot- 


We may gain insight into the physical significance of (Ex),ot = 1/2 Iw? by 
relating it to the linear kinetic energy of the particle. A point on a rotating 
system has an instantaneous tangential velocity vr. Its kinetic energy is there- 
fore 


1 
— 5 NUT 


But, because vr = rw, the kinetic energy may be written as 


1 
Ex = 5 ros" 


The moment of inertia of a point mass is I = mr?. Therefore, 
1 
Ex aa 5 lot 


Hence, the expression for the rotational kinetic energy in terms of I and @ is 
simply another form of the kinetic energy of a particle rotating about a fixed 
axis. 

The expression 3/7 for the rotational kinetic energy can be readily shown 
to be applicable to the rotation of a rigid body made up of discrete masses 
m;. The rotational kinetic energy of the ith particle is 


1 
(E,)rot Of ith particle = 5 m;r7w? (8.18) 
and the (E;),o. of the body is the sum of (E;),ot of the individual masses 
1 n 
(Ex)rot = = = Mir?w? 
Zi 


Because the body is rigid, all point masses rotate with the same angular 
velocity regardless of their distance from the axis, so #7 = w* and it can be 
factored out of the sum. We then have 


1 "1 
(Ex) rot =-w > mit? 
2 i=1 


But the quantity in the summation is the definition of the moment of inertia 
I for a system of particles (see Eq. 8.6) and therefore 


1 
(Ex)rot = 5 lot (8.19) 


(Eyrot = 5 10? 
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A body can be rotating as it translates through space; for example, the earth 
rotates about its axis as its center of mass moves about the sun. Clearly, the 
earth’s rotation gives it more kinetic energy than if it were moving without 
rotation. Its total kinetic energy is therefore the sum of translational and 
rotational kinetic energies 


1 1 
(Ex)total ae (Ex)trans + (Ex)rot = 5 MVM + 5 lot 


where Ucy is the translational velocity of the center of mass. 


Example 8-2 


A large wheel of radius 0.4 m and moment of inertia 1.2 kg-m’, pivoted at 
the center, is free to rotate without friction. A rope is wound around it and 
a 2-kg weight is attached to the rope (see Fig. 8-4). When the weight has 
descended 1.5 m from its starting position (a) what is its downward velocity? 
(b) what is the rotational velocity of the wheel? 


Solution 

(a) We may solve this problem by the conservation of energy, equating 
the initial potential energy of the weight to its conversion to kinetic 
energy of the weight and of the wheel. 


1 1 
mgh = 5 mv + 5 Iw” 


The downward velocity v of the weight is equal to the tangential 
velocity at the rim of the wheel v7; therefore 


Substituting for w 
1 1_v? 
mgh = 5 mv + 515 


We solve for the velocity v 


1/2 


_ mgh 
on ees 
2 2r? 
1/2 
_ | (2kg)(9.8 m/sec’)(1.5 m) 
ae (1.2 ke-m?) 
(5) 28) + Q)0.4 mp 
v = 2.5 m/sec 


(E, Jrotal = 


2 


FIGURE 8-4 
Example 8-2. 


“r | 
mvcm + 5 la 


(b) The answer to part (a) shows that any point on the rim of the wheel 
has a tangential velocity of vr = 2.5 m/sec. We convert this to rota- 
tional velocity of the wheel 


oO = ae ae = 6.2 rad/sec 


8.4 POWER 


The definition of power is work done per unit time. The incremental amount 
of work done in moving the mass in Fig. 8-3 a distance ds = r d@ is given in 
Eq. 8.12. 


dW, = 7 d6 (8.12) 


But, from Eq. 5.15 
Power = — 
Substitute Eq. 8.12 to obtain 
7 dé 
Power = “dt 
or, because w = dO6/dt, 
| Power = Tw (8.20) 


Example 8-3 


A machine shop has a lathe wheel of 40-cm diameter driven by a belt that 
goes around the rim. If the linear speed of the belt is 2 m/sec and the wheel 
requires a tangential force of 50 N to turn it, how much power is required to 
operate the lathe? 


Solution Use Eq. 8.20 


Power = Tw 


If there is no slipping between the belt and the wheel, the linear speed of 
the belt is equal to the tangential velocity at the rim of the wheel and the 
rotational velocity is therefore 


10 rad/sec 


POWER 


Power = Tw 
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From Eq. 8.5 the torque is 
7=rFsingd 
= (0.2 m)(50 N) sin 90° 
= 10 Nm 


then 
Power = 10 Nm x 10 rad/sec 


1 hp 
100 W €: F) = 0.13 hp 


8.5 ANGULAR MOMENTUM 
We learned that an important property of a particle or of a system of particles 
(a body) is its momentum p = mv. An equivalent property can be associated 
with a rotating body. 
Consider, as shown in Fig. 8-5, a particle of mass m with momentum 
p = mv in the x-y plane. The position vector of m is r, which is not required 
to be a constant. 
From Newton’s second law we write 
d 
F=— a oe) (4.2) 
If we take the cross product of both sides with the position vector r, we obtain 
d 
rX F=r xX — (mv) (8.21) 
dt 
By definition r x F = 7 and therefore 
at 
T= rx — a wv) (8.22) 
The right side of Eq. 8.22 can be rewritten as 
d 
rx — a ww) = “( x mv) (8.23) 


The equivalence of the two expressions becomes evident if we differentiate 
the second term 


— (r X mv) = ox my +X < (mv) 
dt dt 
But by definition dr/dt = v, the instantaneous velocity of the particle; therefore 


—(r X = vx ae ae Gay) 
a mv) =vxm it 


FIGURE 8-5 


CONSERVATION OF ANGULAR MOMENTUM 


We saw in Chapter 2 that the cross product of two vectors in the same direction 
is zero. Therefore 
vX mv = m(v X v) = 0 


and, therefore 


d d 
Fra xX mv) =r xX a Wmv) 
ar d 
Substituting Eq. 8.23 for r x rr (mv) in Eq. 8.22 yields 


2 x 8.24 
T= mv) (8.24) 


We have already indicated that the torque 7 in rotational motion plays the 
role of the force F in translational motion. Thus, if we compare Eq. 8.24 with 
Eq. 4.2, we are led to the conclusion that the quantity r x mv in rotational 
motion plays the same role as does the momentum mv in translation. We 


therefore call 
L=rX mv (8.25) 


the angular momentum of the particle. 
We can find another expression for L that shows even more clearly its 
correspondence to the momentum mv. 


L=rxXmv=rmvsiny 


where y is the angle between the radius vector r and the linear momentum 
mv (see Fig. 8-5). But, from Fig. 8-5, mv sin y = mvr, and 


L = rmvr 
From Eq. 7.4, vr = rw and therefore 
L = mr*o (8.26) 
We have defined mr? as the moment of inertia I of a point mass; hence 
L = Iw (8.27) 
The equation of motion for rotation, Eq. 8.24, can be written 


dL, _ dle) 


OFT dt (8.28) 


8.6 CONSERVATION OF ANGULAR MOMENTUM 


We will now show that the law of conservation of momentum that was derived 
in Chapter 6 applies equally to angular momentum. Start with Eq. 8.28 
_ d(Iw) — ab 


T= —= 


dt dt 


d . 
— rT (r X mv) 


L=rxX mv 


L, = Tea 
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If we have a situation in which there is no net externally applied torque, then 
7 = 0. Thus 
dL _ 
dt 
and L = constant. Hence, Iw = constant. 
Therefore, with no net external torque 
(lw)o = (lw), (8.29) 


This is known as the law of conservation of angular momentum. 


Example 8-4 


Suppose the body of an ice skater has a moment of inertia | = 4 kg-m* and 
her arms have a mass of 5 kg each with the center of mass at 0.4 m from her 
body. She starts to turn at 0.5 rev/sec on the point of her skate with her arms 
outstretched. She then pulls her arms inward so that their center of mass is 
at the axis of her body, r = 0. What will be her speed of rotation? 


Solution 
[p@p = [pars 
(body + Tamms) @o = Iboay MF 
(Iboay + 2 mr?) Wo = Tpoay y 
Solving for a, 


(Ibcay + 2 mr?) wy | [4 kg-m? + 2 x 5 kg x (0.4 m)’] (0.5 rev/sec) 
OF = sO TT OO 0 


(lw)) = (Jw), 


— 


Due to the small torque exerted 
by the ice, the angular momen- 
tum of a spinning skater is al- 
most constant. As a result, when 
the skater pulls her arms inward, 


lboa 4 kg-m? thus reducing its moment of in- 
» ertia, her angular velocity in- 
= 0.7 rev/sec creases. 
PROBLEMS 
8.1 A bicycle wheel of mass 2 kg and radius celeration of the system at the instant shown in Fig. 
0.32 m is spinning freely on its axle at 2 rev/sec. 8-6? 


When you place your hand against the tire the wheel 
decelerates uniformly and comes to a stop in 8 sec. 
What was the torque of your hand against the wheel? 


8.2 Two masses, m, = 1 kg and m. = 5 kg, are 

connected by a rigid rod of negligible weight (see 

Fig. 8-6). The system is pivoted about point 0. The 

gravitational forces act in the negative z direction. 

(a) Express the position vectors and the forces on 

the masses in terms of unit vectors and calculate the FIGURE 8-6 
torque on the system. (b) What is the angular ac- Problem 8.2. . 


8.3 A roulette wheel with I = 0.5 kg-m? rotating 
initially at 2 rev/sec coasts to a stop from the constant 
friction torque of the bearing. If the torque is 0.4 
N-m, how long does it take to stop? 


(Answer: 15.7 sec.) 


8.4 A grindstone with I = 240 kg-m? rotates with 
a speed of 1 rev/sec. A knife blade is pressed against 
it, and the wheel coasts to a stop with constant de- 
celeration in 12 sec. What torque did the knife exert 
on the wheel? 


8.5 Four identical masses (m = 2 kg) are connected 
by rods of negligible weight to form a rectangle (see 
Fig. 8-7). The masses are rotated about an axis per- 
pendicular to the plane of the rectangle and passing 
through its center with an angular acceleration a = 
3 rev/sec*. What torque is needed? 

(Answer: 1282 N-m.) 


FIGURE 8-7 
Problem 8.5. 


8.6 Repeat problem 8.5 for a rotation, with the same 
angular acceleration, about an axis through a corner 
of the rectangle (see Fig. 8-8). 


FIGURE 8-8 
Problem 8.6. x 


8.7 A uniform wooden board of mass 20 kg rests 
on two supports as shown in Fig. 8-9. A 30-kg steel 
block is placed to the right of support A. How far 
to the right of A can the steel block be placed without 


aa ? 
tipping the board’ (Answer: 2.0 m.) 
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FIGURE 8-9 5 
Problem 8.7. B A 


8.8 A wheel (I = 30 kg-m’) is pivoted about an axis 
through its center. A 90 N-m torque is applied to 
the wheel, which then accelerates from rest to an 
angular velocity of 20 rad/sec in 10 sec. (a) What is 
the friction torque of the bearings? (b) If the applied 
torque is removed after 60 sec, how long will it take 
for the wheel to come to rest? 


8.9 Calculate the change in rotational kinetic en- 
ergy of the roulette wheel and grindstone of prob- 
lems 8-3 and 8-4. 


8.10 A ball of mass 0.3 kg and radius 0.1 m rolls 
along the ground with a transverse speed of 4 m/sec. 
It comes to a slope inclined at 30°. How far up the 
slope does it roll? (I (ball) = 2/5 mr?). 

(Answer: 2.29 m.) 


8.11 A wheel of moment of inertia 60 kg-m? and 
radius 1.5 m is rotating about an axis through its 
center with an angular velocity w = 30 rev/sec. A 
brake is applied producing a normal force of 450 N 
against the rim (see Fig. 8-10). The coefficient of 
friction between the wheel and the brake is 0.5. 
(a) How long will it take for the wheel to stop? (b) 
Calculate the work done by friction, and show that 
this is equal to the change in the kinetic energy of 
the wheel. 


FIGURE 8-10 
Problem 8.11. 


8.12 Although most people are concerned about 
the horsepower of a car’s motor, the important pa- 
rameter is the amount of torque that can be given 
to the rear wheels. The torque of the motor is turned 
at right angles to the wheels by the differential gear. 
Assume that in low gear the angular velocity of the 
rear wheels is 0.1 that of the motor. If the motor has 
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200 hp and is turning over at a rate of 1400 rev/min, 
how much torque is delivered to the rear wheels? 
(Answer: 1.02 x 10* N-m.) 


8.13 A 4-kg block is attached to one end of a light 
rope. The other end of the rope is wrapped around 
a pulley of moment of inertia I = 0.5 kg-m* and 
radius r = 0.2 m (see Fig. 8-11). The block is released 
from rest, and it moves down 9 m in 3 sec. (a) What 
is the friction torque of the bearings? (b) Use energy 
principles to calculate the velocity of the block after 
it has fallen 9 m. 

(Answer: (a) 1.24 N-m, (b) 6.0 m/sec.) 


4 ws 
hs a's 


FIGURE 8-11 


Problem 8.13. adh 


8.14 A 2-kg block resting on a frictionless table is 
connected by a string passing over a pulley to a 
second block, m, = 5 kg, hanging over the edge of 
the table 0.8 m above the floor (see Fig. 8-12). The 
moment of inertia of the pulley is 0.8 kg-m? and the 
radius is 0.1 m. Neglect the friction of the bearings 
and assume that there is no slipping between the 
string and the pulley. Use energy methods to cal- 
culate the velocity of m2 as it hits the floor. 

(Answer: 0.95 m/sec.) 


é kg 


FIGURE 8-12 
Problem 8.14. 


tion between the 2-kg block and the table is 0.25. 


8.16 Use the data in problem 7.14 to find the an- 
gular momentum of an electron in the smallest orbit 
of the hydrogen atom. 

(Answer: 1.05 x 10~* J-sec.) 


8.17 A50-gm mouse falls onto the outer edge of a 
phonograph turntable of radius 20 cm rotating at 33 
rev/min. How much work must it do to walk into 
the center post? Assume that the angular velocity of 
the turntable does not change. 


8.18 A children’s merry-go-round of radius 4 m 
and mass 100 kg has an 80-kg man standing at the 
rim. The merry-go-round coasts on a frictionless 
bearing at 0.2 rev/sec. The man walks inward 2 m 
toward the center. What is the new rotational speed 
of the merry-go-round? What is the source of this 
energy? (The moment of inertia of a solid disk is 
I = 1/2 mr’). 

(Answer: 0.37 rev/sec.) 


8.19 A mass of 0.1 kg on a String is rotating on a 
frictionless table with w = 1 rev/sec and r = 0.2 m. 
The string passes through a hole in the table and is 
held by a hand below (see Fig. 8-13). (a) What is the 
angular momentum of the mass? (b) What is the 
kinetic energy of the mass? (c) If the string is pulled 
down by the hand until r = 0.1 m, what is the new 
rotational speed of the mass? (d) What is the new 
kinetic energy? (e) How much work did the hand 
do in pulling the string? 

(Answer: (a) 2.51 x 10~? J-sec, (b) 7.9 x 10°7J, 

(c) 4 rev/sec, (d) 3.16 X 1071 J, (e) 2.37 x 107! J 


FIGURE 8-13 
Problem 8.19. 


Kinetic Theory of 
Gases and the 
Concept of | 
Temperature 


112 KINETIC THEORY OF GASES AND THE CONCEPT OF TEMPERATURE 


9.1 INTRODUCTION 


Heat and temperature proved to be very elusive concepts to early scientists. 
Most historic theories—for examples, the phlogiston and the caloric—as- 
sumed that heat was a substance that could flow, much as a gas or a fluid. 
In fact, the mathematics of heat flow were correctly worked out before sci- 
entists learned the true nature of heat and its associated property, tempera- 
ture. For indeed “heat’”’ does flow; but what is heat? 

Our modern understanding of heat, temperature, and the behavior of 
gases is the result of two and a half centuries of scientific investigation; we 
now know that heat is a form of energy. We will not tell the entire story, 
because it is beyond the scope of this book. We will, however, trace the story 
through the measurement of temperature, the ideal gas law, and then the 
application of the first principles of mechanics, as developed in the earlier 
chapters, to the average motion of molecules in gases. The identification of 
the average kinetic energy of molecules with temperature will then be shown. 
From that we will be able to write the first law of thermodynamics, which is 
a broadened statement of the law of conservation of energy. 


9.2 MOLECULAR WEIGHT 


In real systems there are vast numbers of atoms, all of which obey the first 
principles of physics, or quantum variations of them, in their motion. The 
motion of each is different however, so a conclusion about the behavior of a 
group of atoms is statistical. In this book we will consider only systems 
composed of identical atoms, or molecules such as oxygen molecules (O2) or 
nitrogen molecules (Np). 

Ensembles of different atoms or molecules have different statistical av- 
erages of their properties. This is because if objects that have the same kinetic 
energies, mv’, have different masses, then their velocities are different. It is 
therefore important to know the mass of the atoms or molecules that make 
up the ensemble. It is clear that if we know the mass of the ensemble and 
the weight of each particle of the ensemble, then we can immediately deter- 
mine the number of particles. 

We use a unit called the mole (abbr. mol) as a measure of the number of 
particles with the following definition. A mole of a substance is that quantity 
which contains the same number of particles as there are atoms in 12 g (12 X 
10~° kg) of carbon-12. The measure in SI units is the kilomole (kmol), which is 
the quantity of the substance that contains the same number of particles as 
there are atoms in 12 kg of carbon-12. All the elements have isotopes, that is, 
atoms with the same chemical properties but with a slightly different mass. 
Therefore, a single isotope of carbon (carbon-12) is chosen as the reference 
standard. This standard is said to have a mass of exactly 12 u per atom, where 
u is called an atomic mass unit and has the value 


lu = 1.66057 x 10-2’ kg 


Amedo Avogadro (1776-1856). 


The mass of an atom (or molecule) in atomic mass units is called the atomic 
weight (or molecular weight). Thus, for example, the atomic weight of carbon- 
12 is 12 u, that of hydrogen-1 is 1.0078 u. The mass, in grams, of a mole of 
a substance is numerically equal to the atomic weight (or molecular weight) 
of the atoms of that substance, and it is referred to as the gram atomic weight 
(or gram molecular weight). The mass of 1 mole of carbon-12 is 12 g/mole, that 
of hydrogen-1 is 1.0078 g/mole. Often, the word “gram” is deleted from the 
expressions for the mass of a mole, which may lead to confusion. We can 
rely on the units to see whether we are dealing with the mass of an atom or 
that of a mole. 

By the use of very careful techniques, chemists and physicists have been 
able to measure the number of atoms in 1 mole of a substance. This is called 
Avogadro’s number and has the value 


Na, = 6.022 x 1073 atoms/mol = 6.022 x 107° atoms/kmol 


Therefore, if we know the number of moles of substance present in the 
ensemble, we can calculate the number of atoms or molecules present. 

We will use the symbol n to represent the number of moles present, 
where n may be greater or less than one, or equal to it. Because one usually 
has less than 1 mole, n has the name mole fraction. If we denote M as the 
gram molecular weight of a substance and m as the mass of the amount 
present, then n = m/M and, because M has Na atoms or molecules, the 
number of atoms or molecules present is nNa. 


9.3 THERMOMETERS 


It has been known from ancient times that solids and liquids expand when 
they are heated. It is not known when the first thermometers were made, 
but they are believed to have been brought into general use by Duke Ferdinand 
of Tuscany in 1654. They were generally used shortly thereafter by members 
of the Academy of Science of Florence (which was founded by him) and were 
long known as Florentine thermometers. They were much like modern ther- 
mometers in that they had'a colored liquid, presumably alcohol, hermetically 
sealed in a tube with a bulb at one end, with little pieces of colored glass to 
mark even divisions on the scale. | 

In 1714 Gabriel Fahrenheit proposed that a scale be established in which 
the temperature of the human body be taken as 100° (which has since been 
corrected to 98.6°) and 0° be the lowest temperature attainable with a mixture 
of ice and salt, sodium chloride (NaCl). Using this scale, the melting point 
of pure ice is 32° and the boiling point of pure water at sea level is 212°. 
Shortly after Fahrenheit’s death in 1736 a different scale, Centigrade or Cel- 
sius, came into use; by this scale, the melting point of ice was taken at 0°C 
and the boiling point of water at 100°C. 

The Fahrenheit scale is still popularly used in the United States and 
Canada, probably because of its finer divisions for meteorologic measure- 
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One of the earliest thermometers 
used, was Galileo’s thermoscope 
shown here. 
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ments, but in scientific laboratories and in most of the world the Celsius scale 
is used. The conversion between the two has ever since been confusing to 
the layperson, but, with a little thought, one can eliminate the difficulty. The 
conversion is easy to see if we construct yet a third scale, which we will call 
the °F-32 scale, in Fig. 9-1. If we take an arbitrary temperature point on the 
°C scale and the same point on the °F-32 scale, we may make a ratio between 
the temperature of °C and °F-32 scales as 


x & 100 
°F-32 180 
or 
a & 5 
-32 9 
Thus 
°C = : (°F - 32) (9.1) 
or 
i = °C + 32 (9.2) 


9.4 IDEAL GAS LAW AND ABSOLUTE TEMPERATURE 


We define the term pressure, P, as force perpendicular to a surface per unit 
surface area, or P = F/A. Therefore, for a given force, the smaller the area 
on which it acts the larger is the pressure. For example, a weight terminating 
in a sharp point can usually make an indentation in a surface on which the 
point rests. But if the point is changed to a flat face of larger area, no mark 
will be made. 

The dimension of pressure is newton/meter* (N/m/?), and in fluids we 
sometimes use the term pascal (Pa) where 1 Pa = 1 N/m*. Atmospheric 
pressure at sea level is approximately 1.01 x 10° N/m’, which is equivalent 
in the English system to 14.7 lb/in’. 

An ideal gas is one that has no tendency to condense. This means that 
the atoms are infinitesimal in size and that there is no attractive force between 
them. Ideal gases do not exist, but they may be approximated by rare gases 
(such as helium, neon, argon) at low pressure, or any other gas at very low 
pressure. Robert Boyle in 1662 showed that if the quantity of gas and its 
temperature remain constant, then the pressure and volume vary inversely. 


p=— (9.3) 


where C is a constant. In 1802, Joseph Gay-Lussac showed that if the quantity 
of gas and its volume remained constant, the pressure is proportional to the 


Water boils r , 


ice melts 
C F F=—32 
scale scale scale 
FIGURE 9-1 


(e) 2 fe 
C = 5 (F-32) 
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temperature or 
P = KT (9.4) 


where K is a constant. 
These two laws may be combined into a single law with a single constant: 


PV = R'T (9.5) 


When equal volumes of the same gas are taken at the same temperature and 
pressure, R' remains constant. If, however, equal masses of different gases 
are taken under the same conditions, R’ varies inversely with the molecular 
weight. 

In the middle of the last century low temperatures were achieved in Lord 
Kelvin’s laboratory in England. The ideal gas law, Eq. 9.5, was examined over 
an extended range of temperatures. It seemed desirable to establish R' as a 
constant and to correct its inverse variation with the mass of the gas used by 
multiplying by the number of moles, n (because n = m/M). With the intro- 
duction of this term n the ideal gas law is writen as 


PV = nRT (9.6) 


where n is the number of moles (mole fraction) and R is now the same for 
all gases. 

When Kelvin’s group examined Eq. 9.6 at constant volume for different 
amounts of a gas 1, Nz, Nz, . . ., the data appeared as in Fig. 9-2. It is seen 
in this graph that the data taken to the lowest achievable temperature T7, all 
lie on straight lines and, if these lines are extrapolated to P = 0, they terminate 
ata common point, —273.16° C. This was called absolute zero, and is the lowest 
possible temperature. It was therefore logical to establish a new temperature 
scale with its zero point at — 273.16° C. Thus, 0°C = +273.16 K, where K is 
the symbol for the new scale, called the Kelvin or absolute scale. It is related 
to the Celsius scale as 


K = 273.16° + °C (9.7) 


With data of the type shown in Fig. 9-2, the gas constant R was evaluated 
by Kelvin and his associates as 


R = 8314 J/kmol-K 


Example 9-1 


What is the temperature of absolute zero on the Fahrenheit scale? 
Solution Absolute zero = —273.16° C. From Eq. 9.2 


9 
°F = =°C + 32 
ac 


9 
F = = X (273.16) + 32 = —459.7° F 


PY = nkT 


FIGURE 9-2 

Results of Lord Kelvin’s experi- 
ments showing a linear variation 
of the pressure of a gas with tem- 
perature when the volume is kept 
constant. The three curves cor- 
respond to different amounts of 
gas. When the curves are extra- 
polated, they terminate at a com- 
mon point T = —273.16°C. 
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Example 9-2 


In a typical experiment to determine the value of the gas constant R, 0.152 g 
of neon gas (atomic weight 20.2 g/mole) is introduced into a 100-cm? flask 
that is closed and attached to a pressure gauge. It is found that when the 
flask is placed in a constant temperature bath at 50° C the pressure of the gas 
is 2 atmospheres (atm). What value of R is obtained? 


Solution The mole fraction n is the ratio of the number of grams present 
to the atomic weight in grams. 
0.152 g 


eo oD 10-2 mol = 752 & 10 kano 
" ~ 30.2 g/mole poet ine is 


The volume is 


William Thompson Kelvin (1824— 


1m \° 
— 102 om3 ( 2) = 10-4 m3 
V = 10° cm (5 = 10°4m 1907). 


The pressure is 


1.01 x 10° N/m? 


= 2 atm ( 
1 atm 


; = 2.02 x 10° N/m? 


The temperature in K is T = 273° + 50° = 323 K. 
The ideal gas law PV = nKT is written as 


_ 2.02 x 10° N/m? xX 1074 m3 
~ 752 x 10-&kmol x 323K 


= 8316 J/kmol-K 


Example 9-3 


In a diesel engine, no spark plug is required because the temperature is raised 
to the ignition point of the air-fuel mixture by compression. In a typical diesel 
engine the air intake is at 27° C and at a pressure of 1 atm, and it is compressed 
to 1/15 of its original volume with its pressure becoming about 50 atm. What 
is the temperature of the air-fuel mixture in the cylinder in °C? 


Solution We note from the ideal gas law that 
PV _ 

a= 

and, if the quantity of gas is kept constant, then the right side of the equation 


is a constant. Therefore, if we change any of the quantities on the left-side, 
the other quantities must change to yield the same constant, nR. This means 


nR 
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that the initial conditions of the left side must equal the final conditions 
because both the initial and the final conditions are equal to the same constant, 
nR. We write this as 


PoVo _ PMs 
To ie 
where T must be in K. 
In this problem 
P, V 
T, = ST, 
Ee Pave” 


50 atm V,/15 m3 
latm V,m? 


1000 K = 727°C 


x 300 K 


9.5 KINETIC THEORY OF GAS PRESSURE 


We will now show how the concept of momentum conservation and the 
definition of pressure can be used to calculate the statistical behavior of a 
large number of atoms or molecules in a gas. One of the assumptions in this 
calculation is that all collisions between atoms or molecules are perfectly 
elastic. This is not strictly true at high temperatures, because in some high- 
energy collisions electrons are excited or even knocked off atoms. Although 
this situation can be dealt with theoretically, it will not concern us here. 

Because the walls of a container are also made of atoms, then all collisions 
between the atoms of a gas in a container and the walls are elastic. One other 
fact must be kept in mind. In an elastic collision of an atom with the container 
wall, the velocity component normal to the wall is reversed on collision with 
its magnitude unchanged, and the velocity component in the direction parallel 
to the surface of the wall is unchanged. This can be seen in the two-dimen- , r 
sional schematic of Fig. 9-3. Viewed from above, it is clear that a v, velocity 
component would also be unchanged. 


We recall from Chapter 4 that an impulse acting on a body is equal to Nese | sess eee 
the change in the momentum of the body. For the situation of Fig. 9-3, 3 a 
F, At = A (mo,) " ae 
where F, is the average force exerted by the wall of the container on the atom 7 : 
during the time interval At, and where m is the mass of the atom. From . My 
Newton’s third law of action and reaction, the magnitude of the force exerted = FIGURE 9-3 
by the atom on the wall is equal to F, and can be written as Two-dimensional representation 
of an elastic collision of a mole- 
— MOx final ~ 0x initial cule with the wall of the con- 


: At tainer. 
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Because only the direction and not the magnitude of v, changes on collision 
LE, = 2 (9.8) 


where the choice of sign depends on the assignment of velocity direction 
sign. 

Suppose the atom is moving about in a cubical box of side length I, area 
of a face A = Ff and a volume V = PF (see Fig. 9-4). The direction of the force 
in the impulse of Eq. 9.8 and the initial velocity are reversed if the atom 
collides with the opposite wall. Thus, it will be convenient to consider only 
the magnitude of the average force, so we will drop the negative sign in Eq. 
9.8. We may approximate At of Eq. 9.8 as the time between collisions of the 
atom against the wall. This is the time for the atom to travel to the opposite 
wall, bounce off it, and return to the first wall. 

Because the atom’s velocity in the x direction remains constant in mag- 
nitude, the time for a round trip between opposite walls is 


s2 
== 


At (9.9) 
Substituting this into Eq. 9.8 obtains the magnitude of the average force on 
a wall due to the successive striking by one atom 
= 2mv mv 
SS 9.10 
Ox 


This is the average force on a wall in the y-z plane due to a single atom of 
the gas. Let us call the force due to this atom F,, and the x velocity 0,1. Then 
if there is a second atom with velocity v,2, it would contribute a force F,. , 
and so on. The total average force on a wall due to the x motion of N atoms 
in the box would be the sum of the contribution of each, or 


m 
F, = — 0%; + 


mM Mm 
2 
— Usy + "1 Oxy 


l l 


7 (Oh + te + +++ vy) (9.11) 


If N is the total number of atoms in the box, then by the definition of an 
average as the sum of the individual amounts divided by the number of items, 
we may write for v2, the average of the squared individual x velocities, 


2 e es e 2 
_) U21 te O7x2 a3 OxN 


= 9.12 
ve 7 (9.12) 


Substitute Eq. 9.12 into Eq. 9.11 


EF, = —v (9.13) 


FIGURE 9-4 
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By the three-dimensional pythagorean theorem 

C=]o, + ot U 
or, expressed in averages, 

P=wetet+ oe (9.14) 
But in a gas in equilibrium there is no preferred direction of motion; hence 


2 Sil, aie 
Ox = Vy 


g 
Se 


Therefore, Eq. 9.14 may be written as 
v = 30? | (9.15) 


Substitute v2 from Eq. 9.15 into Eq. 9.13, and F, becomes a general force on 
any wall F 


mN 


-F v" (9.16) 


F= 


If we now use the definition of pressure P = F/A, we may write Eq. 9.16 as 


_ mN-s> 
P= 3Al° 
WN a 

P = rae (9.17) 


where V = AI is the volume of the box. Now multiply and divide Eq. 9.17 
by 2 and obtain 
P= oy (5me) (9.18) 


which shows that the pressure of a gas on the walls of a container is pro- 
portional to the average kinetic energy of the atoms or molecules of the gas. 
One should recognize that atom-atom collisions also take place in a gas. In 
a more complete calculation these are considered, but the result given by Eq. 
9.18 remains unchanged. 


9.6 KINETIC THEORY OF TEMPERATURE 


We may now show the relation of molecular motion to temperature by using 
the ideal gas law, Eq. 9.6 


PV = nRT (9.6) 


and the equation we just derived for the pressure, Eq. 9.18. Substituting for 
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the pressure in Eq. 9.6 from Eq. 9.18, we obtain 

aN G m *) = nRT (9.19) 
in which it should be recalled that N is the number of molecules present in 


the box and n is the number or fraction of moles present. By definition 


N (number of molecules) 


n (number of moles) = Na (Avogadro’s number) 


Substituting n = N/N,g in Eq. 9.19 gives 


2 1 - N 
— = = — RT 
SN (5 me] N (9.20) 


The number of molecules, N, cancels, and we are left with the ratio of 
two constants R/N,. This ratio occurs so frequently that it is given the name 
Boltzmann’s constant, after the German theorist, with the symbol kg. Its value 
iS 
k == 
B Na 


_ 8314 J/kmol - K 
~ 6.02 x 1026 molecule/kmol 


1.38 x 10-73 J/K per molecule 


Eq. 9.20 then becomes 
; | 
mou = 5 kg T (9.21) 
or 
Z 
T= >— 9.22 
ae (9.22) 


and we have shown that temperature is simply proportional to the average 
kinetic energy E, of the molecules. Although this calculation has been done 
for gases, the same result is obtained for liquids and solids. 

We may use Eq. 9.21 to find the speed of molecules in a gas. Note, 
however, that the speed will actually be the square root of average squared 
velocity. This is called the root mean square (RMS) velocity and, although it is 
not strictly the average speed, its statistical definition is close enough for our 
purposes. Thus, from Eq. 9.21 


k 
UrRMs = f _ (9.23) 


where T is in K and m is the mass of a single molecule (or atom) in kilograms. 


Exam p le 9-4 


If we consider air to be made up largely of diatomic nitrogen molecules, N3, 
what is their RMS velocity at 27° C? One nitrogen atom has a mass of 14 x 
1.67 xX 10°?’ kg, and the mass of Nz is twice that. 


Solution From Eq. 9.23 


SkpT 3 x 1.38 x 10°* J/K x 300K 
Orms = Wo = Wooo ae TD ):—dSW=:«sOS5 m/sec 


m 28 x 1.67 X 10-2’ kg 


9.7 MEASUREMENT OF HEAT 


The measure of a quantity of heat AQ was established by French scientists 
as the calorie. One calorie is the quantity of heat required to raise the temperature of 
1 g of water by 1° C. (In the English system the measure is the British thermal 
unit (BTU), where 1 BTU is the quantity of heat required to raise the tem- 
perature of 1 lb of water by 1° F.) 

As we discussed in Chapter 5, friction between surfaces causes loss of 
mechanical energy. However, experience shows us that friction produces 
heat. Anyone who has used sandpaper on a wooden surface has observed 
this phenomenon. From the preceding section we recognize that this tem- 
perature rise is due to the increased kinetic energy of the molecules. This 
increase has been produced by the work done on the molecules by the sand- 
paper. So we see that by our understanding of the nature of temperature we 
need not restrict the law of energy conservation to frictionless systems; the 
apparent loss of mechanical energy of the moving system has gone into 
increased mechanical energy of the molecules. 

We may measure how much mechanical energy produces what quantity 
of heat by a simple experiment (see Fig. 9-5). Suppose we have a paddle 
wheel driven by a falling, weight. The paddle wheel is in a known quantity 
of water completely insulated from heat flowing in or out. By letting the 
weight fall with constant velocity and measuring the increased temperature 
of the water, we may find the mechanical energy equivalent of heat. (Note: 
The weight is allowed to fall at constant speed so that no changes in kinetic 
energy have to be considered, only changes in potential energy.) The quantity 
of heat AQ is proportional to the temperature rise. This involves the mass of 
the water m and the specific heat of the water c, which is defined as the amount 
of heat needed to raise the temperature of 1 g of a substance (in this case water) 
1° C. As mentioned earlier, for water this is 1 cal/g °C. Therefore, the quantity 
of heat AQ required to raise the temperature by AT of a mass m of a substance 
whose specific heat is c, is written as 


AQ = mcAT (9.24) 
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AQ = mc AT 


FIGURE 9-5 

Diagram of the apparatus for the 
measurement of the mechanical 
equivalent of heat. 
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We can equate this to the loss of potential energy, AE, = m’gh, of the falling 
weight m’g, that is, 


m'gh = mc AT (9.25) 


For a known amount of AE, and a measured AT, the experiment yields the 
relation 


4.184] = 1 cal (9.26) 


This relation is called the mechanical equivalent of heat. 

We may generalize Eq. 9.24 to any substance. On measurement we find, 
however, that almost all substances have different specific heats. These are 
not readily calculated but can be determined experimentally and are given 
by tables in handbooks. The evaluation of the specific heat (or heat capacity) 
of ideal gases is simpler and is presented next. | 


9.8 SPECIFIC HEATS OF GASES 


If we hold a quantity of gas at a constant volume so that it cannot do work 
by expanding, then all the heat AQ goes into increasing the kinetic energy 
of the molecules, or 


AQ = AE, (9.27) 


We have discussed c as the specific heat per gram of any substance, but 
actually we have been talking about solids and liquids. The situation is dif- 
ferent for gases because they are compressible. We may define a term C, as 
the molar specific heat at constant volume; that is, the specific heat per mole 


C, = cM (9.28) 


where M is the mass of a mole of gas and c, is the specific heat per gram at 
constant volume. The mass of the gas m is the mass of a mole M multiplied 
by the number of moles n, that is, m = Mn. We may rewrite Eq. 9.24 as 
AQ = (specific heat per mole) X (number of moles) x (AT), or 


AQ = Cn AT (9.29) 


The resulting increase in the energy of the molecules may be written as 


AE 
AE, = (number of molecules) x (—cha) 
per molecule 


The number of molecules is equal to the number of moles n, multiplied by 
Avogadro’s number Ng, and, from Eq. 9.21, AE; per molecule = 3/2 kg AT. 


4.184 J = 1 cal 


Therefore, 
3 
AE, = (nNa) (; kp ar) (9.30) 
Substituting Eq. 9.29 for AQ and Eq. 9.30 for AE; in Eq. 9.27, we obtain 
Cun AT = : nN akp AT 
or 


3 
oF = 5 Nake (9.31) 
ae R 
By definition kg = N,’ and Eq. 9.31 becomes 
A 


C, = = R (9.32) 


Note that the conversion factor of Eq. 9.26 will reduce R to a value somewhat 
easier to remember 


R 


1 cal 
8314 J/kmol-K = 8.31 - 
]/kmo 8.314 J/mol K( S53 3) 


cal 


1.987 


mol K = 2 cal/mol-K 


Therefore, Eq. 9.32 for ideal gases, which involve only the translational motion 
of their atoms (no vibration or rotation as in diatomic gases), yields 


cal 
mol-K 


C, = : x 2 cal/mole-K = 3 


This value is expected to hold for all the rare gases that are monatomic, such 
as helium, neon, and argon. Experiment has proven that the agreement with 
theory is excellent. 


9.9 WORK DONE BY A GAS 


Suppose we have a cylinder and a piston with a gas inside, as in Fig. 9-6. 
Let the cross section of the cylinder be A and the weight of the piston plus 
a weight resting on it be mg. Suppose the piston was originally at position 
h, and the gas has expanded and pushed it up a distance dx to position hp. 


WORK DONE BY A GAS 


rm | Go 
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The definition of work, Eq. 5.3, is force times the distance moved in the 
direction of the force. 


dW = F dx (5.3) 


But from the definition of pressure, P = F/A, we may substitute for F and 
obtain 


dW = PA dx 


Since A dx is the change in volume dV, we may write that work done by a 
gas as 


dW = PdV (9.33) 


Note that by the definition of work, if the gas does work by expanding, the 
work done is positive, whereas if the gas is compressed by the force on the 
piston, the work done by the gas is negative. 


9.10 FIRST LAW OF THERMODYNAMICS 


We now have three factors relating energy and the behavior of a gas: 


1. Work done on or by a gas AW = P AV. 
2. The quantity of heat AQ that may be added or extracted from the gas. 


3. The change in average kinetic energy of the molecules AF;, which we 
usually call the change in internal energy AE. 


Because of the interplay of these three terms, it is not meaningful to ask what 
is the amount of heat in a gas. If heat is added to a gas at constant volume, 
it all goes into the increase of the internal energy AE. If, however, the gas is 
allowed to expand and do work when heat is added, then the amount of heat 
available to increase the internal energy depends on the amount of heat that 
has gone into work. For example, if the gas is allowed to expand and do work 
while the temperature is held constant (AT = 0), then the final internal energy 
is the same as the initial and AE = 0. We may logically write these concepts 
in the form of the equation 


AQ =AW+AE (9.34) 


In Eq. 9.34 AQ is taken as positive if heat enters the system (the gas in this 
case) and as negative if it leaves the system. The work, AW, is positive if it 
is done by the system, and negative if done on the system. This simple, logical 
expression bears the ponderous name of the First Law of Thermodynamics. It 
is seen that through the expression for the mechanical equivalent of heat, 
Eq. 9.26, AQ can be expressed as energy, as can AW and AE. We have then 


dW = PdV 


AQ = AW + AE 


FIRST LAW OF THERMODYNAMICS 


a full statement of the law of conservation of energy in which the E,, (energy 
out) term of the mechanical law, Eq. 5.14, is now included as well as the 
energy term E;,, which may be heat. Furthermore, because work, AW, can 
give rise to changes both in potential and in kinetic energy of a body or system 
of bodies, all possible mechanical energy terms have been included. Other 
forms of energy, for example, radiant energy, which will be discussed in a 
later chapter, are also included in the first law of thermodynamics. 


Example 9-5 


Six thousand calories of heat are added to 2 moles of neon gas at 27° C while 
it does 4100 J of work. (a) How much does the internal energy of the system 
increase? (b) What is the final temperature of the gas? 


Solution 
(a) First convert calories to joules 


4.184 J 
1 cal 


AQ = 6000 cal ( ) = 2.51 x 10*J 


Use the first law of thermodynamics 
AE = AQ — AW 
= 2.51 x 10*J —0.41 x 10*J = 2.10 x 10*J 


(b) The relation between the change in the internal energy and the change 
in temperature is given by Eq. 9.30 


AE = (nNa) (5 kp a7) 


Recalling that Na kg = R, we write 


AE = oak AT 
2 
Therefore 
2 AE 
AT = 
3 nR 


: 2 x 2.10 x 10*J 
~ 3x 2mol X 8.314 J/mol-K 


842 K or °C 
Tginal = 27° C + 842°C = 869°C 
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SUPPLEMENT 9-1 


Maxwell-Boltzmann Statistical Distribution 


In this chapter we have derived the mean square velocity v? of a large number 
of atoms or electrons, Eq. 9.21. We did not address the question of how these 
square velocities are distributed. That is, how many atoms have a square 
velocity twice the mean or one half the mean. This problem is in the realm 
of statistical mechanics, the science of the application of the first principles of 
physics to a large number of bodies. The rigorous solution is beyond our 
interest and is too difficult to present here. However, Professor Richard Feyn- 
man has presented a conceptual solution that we will give. 

Suppose we have a very tall glass tube with a column of gas going to a ae _s” £8 
great height such as into our upper atmosphere—but, unlike our atmosphere, Ludwig Boltzmann (1844-1906). 
the temperature is the same at all heights. The problem will be to derive the 
law of the decrease in density of the atmosphere as the height increases. Let 
n be the number of moles of gas in volume V at pressure P. From the ideal 
gas law Eq. 9.6 we write P = nRT/V. We recall that R = kgN4z, therefore, 

P = (nNg/V) (kgT). The ratio nN,/V represents the number of molecules per 
unit volume N. Thus the pressure is proportional to the number of molecules 
per unit volume because the temperature is constant; that is, P = NkpT. 

The pressure is higher the lower we measure it, for at any point the gas 
must support all the gas above it. Consider a small cylindrical volume of gas 
of cross-sectional area A = 1 m* and width dy at a height y (see Fig 9-7). The 
vertical force from below on this gas is P,, because F = P,A and we have 
taken A = 1 m?. The vertical force from above at a height y + dy is less than 
P, by the weight of the molecules in the section between y and y + dy. The 
total number of molecules in this region is the number per unit volume N 
times the volume dy (since A was taken as unity). Each molecule has a weight 
of mg, so the difference in pressure is 


P 


y 


+ ay ~ Py = dP = —mgN dy. (9.35) 


We have seen that P = NkgI’. Because T is constant, the pressure depends 
only on N. Differentiating P with respect to N we obtain 


dP = kgT dN (9.36) 


Equating Eq. 9.36 to Eq. 9.35, we have an equation that can be integrated to 
find N 


ksT dN = —mg N dy 


Or 


FIGURE 9-7 


Integrating 


es 
ie N 7 aca A ay 
which yields 


N = No ew ™svkT (9.37) 


where the constant No is the value of N at y = 0 and this value of y may be 
at any predetermined level. 

We see that the numerator of the argument of the exponential in Eq. 9.37 
contains the potential energy per molecule, and we may write that the density 
at any point is 


N = No e> Evikor (9.38) 


The source of the potential energy need not be gravitational. For example, 
the potential energies of electrons at different distances from their nuclei have 
an electrical origin, as we will see in Chapter 14. 

In Chapter 5 we saw that as a particle moves in a gravitational field (or 
any other conservative field) the potential energy of the particle may change. 
However, the total energy, which is the sum of the potential and kinetic 
energies, remains constant. Thus it is reasonable to assume that if we start 
with a given number of molecules having a certain value for the potential 
energy, the number having that particular value for the total energy, sometime 
later, will be the same. This means that Eq. 9.38 can be generalized to represent 
the number of molecules having a certain value for the total energy E; that 
is, 


N = No ev Eker (9.39) 


Eq. 9.39 is known as the Maxwell-Boltzmann statistical distribution of energy. 

In general, if E, — Ez is some energy difference between energy state 1 
and a higher energy state 2 of a particle, the ratio of the number occupying 
the higher energy state to the lower energy state is given by 


N2 — —(E2— E1)/kpT 

N, € (9.40) 
We can use Eq. 9.39 to answer another important question, namely, what 

fraction of the total number of particles has energy equal to or greater than 

a certain value E;? Because Eq. 9.39 represents the number of particles with 

energy E, the number of particles with energy between E and E + dE will 

be proportional to Noe~ ***" dE. It follows that the fraction of particles with 


SUPPLEMENT 9-1 


N = No e E/kal 
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energies greater than or equal to EF; is 


[Uv ene ae 


Fraction (E = E;) = 


e~ E/kpT 
E; 


e~ E/kpT 


en Ei/kpT 


dE 
kpT 
dE 
kT 


i No e7 ET dE 


Seem (ia) 
[rewmr( 


Fraction (E = E,;) 


(9.41) 


The term of Eq. 9.41 is often referred to as the Boltzmann factor. 


PROBLEMS 


9.1 The molecular weights of sodium and chlorine 

are 22.99 g/mole and 35.45 g/mole, respectively. How 
many molecules of sodium chloride (ordinary salt) 
are there in 100 g of salt? 


9.2 The density of copper is 9 g/cm?, its molecular 
weight is 64 g/mole. What is the number of copper 
atoms in 1 m*? 

(Answer: 8.47 x 1078.) 


9.3 (a) What is the body temperature of a healthy 
person on a Celsius clinical thermometer? (b) If the 
person had a temperature of 102° F what would the 
thermometer read? 


9,4 At what temperature will Fahrenheit and Cel- 
sius thermometers read the same value? 
(Answer: — 40°.) 


9.5 A gas bubble rises from the bottom of a lake to 
the surface. If the pressure at the bottom is three 
times atmospheric pressure and the temperature is 


4° C while near the surface the temperature is 24° C, 
what is the ratio of the volume of the bubble just 
before it reaches the surface to its volume at the 
bottom of the lake? 


9.6 In an ultrahigh vacuum system, the pressure 
can be lowered to 107° torr (1 torr = 1/760 atm). 
How many molecules are there in a vacuum chamber 
of volume 8 xX 10° cm? if the pressure is 107 ?° torr 
and the temperature is 27° C? 1 atm = 1.01 x 10° 


2 
N/m/. (Answer: 2.57 x 1013.) 


9.7 A gas tank contains 10 kg of oxygen at a pres- 
sure of 10’ N/m? and a temperature of 27° C. As a 
result of a leak, the pressure drops to 5 x 10° N/m? 
and the temperature decreases to 7° C. (a) What is 
the volume of the tank? (b) How much oxygen has 
leaked out? The molecular weight of oxygen is 
32 g/mole. 


9.8 The interior of the sun is at a temperature of 


about 1.5 x 10® K. The energy is created by the 
fusion of hydrogen atoms when they collide. In de- 
veloping the technology for a fusion reactor we sim- 
ulate this fusion reaction by accelerating protons 
(hydrogen nuclei) and letting them strike fixed-tar- 
get hydrogen atoms. What must be the velocity of 
the protons to simulate 1.5 x 10% K? 


9.9 One gram of Ne gas (atomic weight 20.2 g/mole) 
is in a sealed flask at room temperature, 27° C. If 10 
calories of heat are added to the gas, what is the 
Urms Of the molecules? 

(Answer: 6.72 X 10% m/sec.) 


9.19 The temperature of a gas in a closed container 
at 27° C is raised to 327° C. By what multiple has 
Vrms Changed? 


9.11 The temperature of aroom 7m X 5m X 3m 
is 27° C. (a) How much energy is contained in the 
air of that room? (b) If that energy could be converted 
to electrical energy, for how long could a 100-W bulb 
be lit? Assume the air behaves as an ideal gas. 


9.12 A 300-W immersion heater is used to heat a 
cup of water. If the cup contains 150 g of water at 
27° C and 80% of the heater energy is absorbed by 
the water, how long will it take for the water to begin 
to boil? 

(Answer: 191 sec.) 


9.13 A 1000-W heater is used to heat the room of 
problem 9.11. If the molar specific heat of air is 
5 cal/mole - K, how long will it take to raise the tem- 
perature of the room from 60° F to 70° F? Assume 
that the quantity, volume, and pressure of air do 
not change. 


9.14 How many calories of heat must be added to 
0.5 g of neon gas at constant volume to raise its 
temperature from 27° C to 127° C? The molecular 
weight of neon is 20.2 g/mole. 

(Answer: 7.40 cal.) 


9.15 To heat a certain quantity of gas from 27° C 
to 127° C requires 500 cal when its volume is kept 
constant. By how much does its internal energy 
change? How much work could the gas do in cooling 
back to 27° C? 
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9.16 A 20-g bullet is shot into a ballistic pendulum 
with a velocity of 1000 m/sec (see Fig. 9-8). The mass 
of the wooden block is 2 kg. If the bullet remains 
embedded in the block and 80% of the energy lost 
in the collision is absorbed as heat by the bullet, 
what is the increase in the temperature of the bullet? 
The specific heat of the bullet is 0.1 cal/g-°C. 
(Answer: 946° C.) 


=m ae! 


FIGURE 9-8 
Problem 9.16. 


9.17 Ten moles of an ideal gas at a pressure of 8 
atm and with volume 10~? m® are allowed to expand 
isothermally (constant temperature) until the vol- 
ume doubles. What is the work done by the gas? 
(1 atm = 1.01 x 10° N/m’). 

(Answer: 5.6 X 10° J.) 


9.18 Use the ideal gas law, Eq. 9.6, and the first 
law of thermodynamics, Eq. 9.34, to show that the 
molar specific heat for a process at constant pressure 
C, = C, + R, where C, is the molar specific heat at 
constant volume and R is the universal gas constant. 


9.19 The initial pressure and volume of 0.1 moles 
of argon gas are 1 atm (1.01 x 10° N/m’) and 1 liter 
(10~° m?) (see Fig. 9-9). The gas is heated at constant 
volume until the pressure rises to 4 atm (path A). 
The gas is then allowed to expand along path B until 
the pressure drops to 1 atm. The gas is finally cooled 
down at constant pressure until it returns to its initial 
state (path C). (a) Find the temperature of the gas 
at the end of each process (points 1, 2, and 3). (b) 
Find the internal energy of gas at points 1, 2, and 
3. (c) Calculate the work done in each process. (d) 
Calculate the heat entering or leaving the gas during 
each process. 
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(Answer: (a) 122 K, 487 K, 609 K, (b) 152 J, 607 J, 
760 J, (c) OJ, 1.01 x 10° J, —4.04 x 107 J, 
(d) 455 J, 1163 J, —1012 J (leaving the gas).) 


7 7 
a 
i" 
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Volume (liters) -—— 


FIGURE 9-9 
Problem 9.19. 


9.20 A mole of an ideal gas is taken from state A 
to state C along the path ABC (see Fig. 9-10). (a) If 
1000 cal of heat flow into the gas and the gas does 
2100 J of work, what is the change in the internal 
energy of the gas? (b) When the gas is returned from 


C to A along the path CDA, 700 cal of heat flow out 
of the gas. How much work is done on the gas? 
(c) What is the change in the temperature of the gas 
when it is brought back from C to A? (d) If the 
pressure of the gas in state A is 2 atm, what is the 
difference in the volume of the gas between states 
D and A? 


Pressure —— 


FIGURE 9-10 
Problem 9.20. 


Volume -—— 
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10.1 INTRODUCTION 


In this chapter we will develop the concepts of oscillatory motion. When a 
block attached to a spring is set into motion, its position is a periodic function 
of time. Similarly, in Chapter 7 when we considered the motion of a particle 
rotating in a circle, we saw that the position coordinates were oscillatory 
functions of time. Specifically, we showed (Fig. 10-1a) that the components 
of a position vector r making an angle 6 with the x axis were 


x = rcos 6 
(10.1) 
y =rsin 6 
and that the components of the acceleration, second derivatives with respect 
to time of these coordinates, were (Fig. 10-1b) 


d2 
a, = - = —rw* cos 0 
- (10.2) 
_ 4y _ ; 
ay = ae —rw* sin 0 


As the angle 0 goes from 0° to 360° the components of the position vector 
and of the acceleration vary in value as the sine or cosine functions and go 
through a reversal of sign; in other words, their values oscillate sinusoidally. 
In this chapter we will derive some properties of this oscillatory motion with 
essentially the identical mathematical method that generated Eqs. 10.1 and 
10.2. 


10.2 CHARACTERIZATION OF SPRINGS 


Robert Hooke (1635-1703), an English scientist, was the first to elucidate the 
behavior of an elastic body such as a spring. He found that the extension or 
compression x of an elastic body is proportional to the applied force F or 


Fax 


This simple relationship is now known as Hooke’s law. We introduce a pro- 
portionality constant to create an equality. This constant has the symbol k 
and is called the force constant or spring constant 


F=k (10.3) 


In the case of a spring, the value of the constant k characterizes the strength 
(or stiffness) of the spring—a spring with a large k is stronger, or stiffer, than 
one with a small k. We may readily measure the value of k by simply hanging 
a weight on the spring and measuring how much it stretches. For the mea- 
surement to be valid, the spring must return to its original length when the 


(2) 


(0) 
FIGURE 10-1 


weight is removed. The extension of the spring in this case is within its elastic 
limit. If a spring is stretched beyond its elastic limit it will deform permanently 
and Hooke’s law is not obeyed. 


10.3 FREQUENCY AND PERIOD 


Suppose we have a periodic event, that is, one that occurs regularly with 
time such as the rising of the sun. We know that it occurs once each day; 
that is, its frequency v is one event per day, and it has dimensions of (time) ~* 
because event is dimensionless. We use another quantity, the time between 
periodic events, known as the period with symbol T. The period of the sun’s 
rising is 1 day per event, and obviously has dimensions of time. It is seen 
that period and frequency are reciprocals of each other. 


p= (10.4) 


If, for example, the time between periodic events is T = 0.2 sec, then the 
number of events per second, the frequency »v, will be 


1 


yp = ———— = 5 events/sec 
0.2 sec/event 


As we have indicated, frequency has units of (time)~' or events per second. 
One event per second is called one hertz, abbreviated Hz. 

If a point particle moves on a circle as in Fig. 10-1a, its position vector 
from the center of the circle to the particle has the magnitude of the radius. 
This is often called the radius vector of the point, or simply the radius vector. 
As the particle moves, the radius vector rotates and if the particle moves with 
constant speed in the counterclockwise direction, we say that it rotates with 
a constant positive rotational speed w. Then 6 is a function of time, and from 
Eq. 7.8 


6 = aot (10.5) 


We can use this result to express the coordinates x and y of the rotating 
particle in Fig. 10-1 as explicit functions of time and the frequency of rotation. 
Substituting Eq. 10.5 for 6 in Eq. 10.1, we obtain 


r cos wt 


x 


y 


The angular speed w is related to the frequency of rotation v rather simply. 
In every rotation 6 changes by 27 rad. If the particle performs v rotations in 
1 sec, then 6 will change by 27v rad every second. By definition, w is the 
change in @ per unit time (per second). We conclude that 


@ = 2iv (10.7) 


(10.6) 


r sin wt 
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Substituting Eq. 10.7 for w in Eq. 10.6 we may write 


y= 7 — 27vt (10.8) 
y = r sin 27vt 


10.4 AMPLITUDE AND PHASE ANGLE 


Fig. 10-2 shows a plot of sin 0 versus 0. We see that the value of sin 60 oscillates 
between + 1 and —1. The maximum value of the magnitude of this oscillation 
is called the amplitude. In Fig. 10-2, the amplitude is 1. If sin 6 were multiplied 
by a constant A, then A would be the amplitude in the expression A sin 6. 
Suppose instead of sin 0 we plot the function sin(@ + 7/4). We see that when 
6 = 0 the function has the value of sin 7/4 and thereafter attains all values 
of sin 6 at an angle 7/4 earlier, as shown in Fig. 10-3a. If, on the other hand, 
we plot the function sin (6 — 7/4) we see in Fig. 10-3b that it starts later than 
the sin 6 function. The general form for a function to describe a body undergo- 
ing sinusoidal oscillations, such as the one illustrated in Fig. 10-1 is 


A sin (0 + @) 
or, because 0 = wt, this may be written as 
A sin (wt + ¢) (10.9) 


where @ is called the phase angle and its sign may be positive or negative. 
Note that if @ = 7/2, then sin (6 + 7/2) = cos 6, which may be seen by 
sketching a + 7/2 phase shift on Fig. 10-32. The motion described by Eq. 10.9 
is often referred to as simple harmonic motion. 


10.5 OSCILLATION OF A SPRING 


Suppose a body of mass m is connected to a massless spring, with a spring 
constant k, and the body is free to oscillate on a frictionless surface as in Fig. 
10-4. At its rest, or equilibrium: position, the position coordinate is x = 0, 
indicated in Fig. 10-4a. If the body is pushed to compress the spring a distance 
Xo, Fig. 10-4b, or pulled to stretch it a distance xo, Fig. 10-4c, and then released, 
the body will then begin to oscillate. We may calculate its subsequent motion 
from Newton’s second law 


F = ma 


Eq. 10.3 gives an expression of Hooke’s law, F = kx, for a spring. However, 
this is the external force needed to compress or to stretch the spring. By 
Newton’s third law of action and reaction, if you pull on a spring with force 
F it pulls in the opposite direction with force —F. Thus, the force that the 
spring exerts on the body is —kx. Because the acceleration is not constant 
(the force on the body depends on displacement x from equilibrium), we use 


FIGURE 10-2 


(6) 
FIGURE 10-3 


the fundamental definition a, = d*x/dt?. Applying Newton’s second law to 
the body, we obtain 
ke = m 10.10 
x =m 12 (10.10) 
This is a second-order differential equation; although there are straightforward 
mathematical techniques for its solution, we will simply guess a solution and 
substitute it into Eq. 10.10 to see if an equality is maintained. Such a procedure 
can verify that the guessed function is a solution but does not prove that it 
is the only solution. We may try as our guess the function introduced earlier, 
Eq. 10.9, to describe a body undergoing sinusoidal oscillations. Our guess at 
a solution will be 


x = Asin (wt + 6) (10.9) 


We may substitute this directly into the left side of Eq. 10.10, but for the right 
side we need its second derivative 


= = A~ sin (wt + ) = Aw cos (wt + ¢) 
AG eoethe SAS enGEs oe aoa 
dt? dt 
Substituting Eqs. 10.9 and 10.11 into Eq. 10.10 obtains 
—kA sin (wt + ¢) = —mwo’ A sin (wt + ¢) 
Cancelling obtains 
k = mo? 
and 
k 
w = ue (10.12) 


Therefore, Eq. 10.9 is a solution when the constants have the relation of Eq. 
10.12. Using Eq. 10.7, that w = 27, we immediately obtain the frequency of 
oscillation 


p= = (10.13) 


and the period 


1 m 
. p “ k 


To complete the solution of the problem, we must determine the value 
of the amplitude A and of the phase angle ¢ in the expression for x, Eq. 10.9. 
This is done by specifying the boundary conditions, that is, the behavior of the 
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(a) 


(c) 
FIGURE 10-4 


x = Asin (wt 4 @) 
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body at some time such as, t = 0. For example, the body in Fig. 10-4 can be 
set into oscillation by initially stretching the spring a certain distance x = xp 
as shown in Fig. 10-4c and then releasing it. That is, att = 0. 


X = Xp (10.14) 
Vv, = 0 (10.15) 


The first condition, Eq. 10.14, is satisfied by setting x = x) and t = 0 in Eq. 
10.9. This yields 


Xo = Asin gd (10.16) 


To impose the second condition, we must first determine the velocity of the 
body as a function of time. This is done as follows: 


dx ad 
Vx ra? A sin (wt + ¢) 
Vv, = Aw cos (wt + ¢) (10.17) 


The second condition, Eq. 10.15, will be satisfied by setting v, = 0 and 
t = 0 in Eq. 10.17; that is, 


0 = Awcos d (10.18) 


The amplitude and the phrase angle can now be found by solving simulta- 
neously Eqs. 10.16 and 10.18. If we divide Eq. 10.18 by Eq. 10.16, we obtain 


0 7 Aw cos & 
Xo A sin @ 
or 
| cot d = 0 
hence 
T 
od = 5 (10.19) 


Substituting Eq. 10.19 for @ in Eq. 10.16 yields the result 
_ 7 
Xo» = Asin rie A (10.20) 


Thus, we see that the amplitude, in this case, is equal to the initial displace- 
ment of the body from its equilibrium position, and the phase angle is 
m/2 rad. We should note that other boundary conditions will yield different 
values for A and @. 

We can use the facts that sin (6 + w/2) = cos @ and cos (6 + 7/2) = 
—sin 9, to eliminate ¢ from the expressions Eq. 10.9 for x and Eq. 10.17 for 


V,, Which now become 


x = Acos wt (10.21) 


v, = —Ao sin ot (10.22) 


We see by Eq. 10.22 that immediately after release from its stretched 
position to the right, the velocity of the body is toward the left, hence the 
negative sign. When the argument of the sine, wt, exceeds 7, then the sine 
function becomes negative and the velocity is positive, or toward the right. 
We also see from Eq. 10.22 that because the maximum value the sine function 
may have is +1, then the maximum velocity of the block is 


k 
Umax = tAw = +A aE (10.23) 


Furthermore, because sin wt = 1 when wt = w/2 and —1 when wt = 37/2, 
insertion of these values into Eq. 10.21 shows that the maximum velocity 
occurs when x = 0 or at the midpoint of oscillation. It is instructive to compare 
a plot of Eq. 10.22 with one of Eq. 10.21 as shown in Fig. 10-5. Note that the 
amplitude of the displacement A and the maximum value of the velocity Aw 
are not the same because w may be equal to or greater or smaller than unity. 
Figure 10-5 is a plot with Aw ~ 1.2A. It can be seen that the velocity is 
maximum when the displacement is zero and zero when the displacement is 
maximum. The physical significance of this result will become evident when 
we discuss the energy associated with oscillations in the next section. 

We may examine the behavior of the acceleration by taking the time 
derivative of the velocity in Eq. 10.22. 


ee Awe sine 
=-— = —Aw— 
- dt dt 
a, = —Aw* cos wt (10.24) 


We plot a, versus 6 and compare it with the displacement (x value). Because 
w for Fig. 10-5 was taken as 1.2, w? is 1.4, so the maximum value of a, will 
be 1.4 times the amplitude of x. The acceleration and displacement curves 
are plotted in Fig. 10-6. Here we see that the acceleration, although also a 
cosine curve but with a different amplitude, is a reflection about the 6 axis 
of the displacement. That is, when the displacement is maximum in the 
positive direction, the acceleration is maximum in the negative direction. 
Furthermore, when the displacement is zero, so is the acceleration. This 
relation can be seen physically in Fig. 10-7. When the body is displaced to 
the right (positive x direction) a distance A and then released (Fig. 10-72), the 
acceleration is.in the negative x direction. The acceleration is also a maximum 
at this position. This is evident because F = ma and F = —kx. Therefore, 
—kx = ma and a is maximum when x is maximum and x and a have opposite 
signs. When the body reaches x = 0 (Fig. 10-7b), the acceleration is 0, but as 
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FIGURE 10-5 
Plot of x and v as functions of 
time for w greater than unity. 


= +Aw 


i 
Umax 


FIGURE 10-6 
Plot of x and a as functions of 
time for w greater than unity. 
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it passes to the left the displacement x becomes negative and the acceleration 
becomes positive or toward the right (Fig. 10-7c). This behavior is shown 
analytically in Fig. 10-6. At 6 = 7/2, which corresponds to zero displacement, 
the acceleration goes to zero. As the displacement becomes negative 
(7/2 < 6 < 37/2) the acceleration becomes positive. We also note from Fig. —_—»> 
10-6 that the two maxima of acceleration occur at 6 = 0 or 7 (wt = 0 or 7). 
The maximum values of the acceleration at these two positions are, from Eq. 
10.24, 


a, = —Aw* cos0 = —Aa? ; a=0 
a, = —Aw® cos 7 = Aw : i 
or w) om 
Amax = + Aw (10.25) | 


— 


Example 10-1 ae 
rece} a 
Show that x = A cos (wt + ¢) is also a solution of Eq. 10.10. asa a 


t 


Solution <> 
d*x (c) 
—kx =m, (10.10) FIGURE 10-7 
Take the second derivative and substitute into the right side of Eq. 10.10. 
dx d : 
ra A a cos (wt + ¢) = —Aw sin (wt + ¢) | Anaxr = tAw? 
d*x d 
ee, ee. es a a + ae 2 ae 
1 Aw Tx sin (wt + ¢) Aw* cos (wt + ¢) 
Substituting in Eq. 10.10, 
—kA cos (wt + ¢) = —mAo?’ cos (wt + ¢@) 


After cancelling the cosine term from both sides of this equation, we obtain 


aE 
w= 14/— 
Mm 


Thus, A cos (wt + @) is a solution of Eq. 10.10 for this value of w. 


Example 10-2 


A given spring stretches 0.1 m when a force of 20 N pulls on it. A 2-kg block 
attached to it on a frictionless surface as in Fig. 10-4 is pulled to the right 
0.2 m and released. (a) What is the frequency of oscillation of the block? (b) 
What is its velocity at the midpoint? (c) What is its acceleration at either end? 
(d) What are the velocity and acceleration when x = 0.12 m, on the block’s 
first passing this point? 


OSCILLATION OF A SPRING 
Solution First we must determine the spring constant k. 
F 20N 
k = — = —— = 200 N/m 
x 0.1 


(a) We may then calculate w from Eq. 10.12 


2 
oO = is - oN BEES = 10 rad/sec 


Because 


w = 27V 


ya &. q Wtadisee _ i 6 He 
27 27 


(b) The velocity is a maximum when x = 0, that is, at the midpoint. 
Therefore, from Eq. 10.23, (recall, as shown earlier in this section, 
that when a block is initially displaced a distance x9 from its equi- 
librium position and then released, the amplitude of the motion 
A = Xp) 


V = Umax = +Aw = +(0.2m) (10 rad/sec) = +2 m/sec 
(c) The acceleration is a maximum at the two extremes of the motion. 
Therefore, from Eq. 10.25 
= +A? = +(0.2m) (10 rad/sec)? = +20 m/sec? 


Amax 


(d) To determine the block’s velocity and acceleration at some arbitrary 
value of x, we need to know the angle wt at that position. In this 
problem, x = 0.12 m. We use the relation 


x = Acos at (10.21) 
= 55° 


C . arc Cos Wat 
sae 
arc COS | oN 


wt 


Then we may substitute into Eqs. 10.22 and 10.24 


v = —Ao sin wt | (10.22) 
= —(0.2m)(10 rad/sec) sin 53° = —1.6 m/sec 
(Moving toward the left) 
a = —Aw’ cos ot (10.24) 


= —(0.2m)(10 rad/sec?)* cos 53° = —12 m/sec* 
(Accelerating toward the left) 
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10.6 ENERGY OF OSCILLATION 


In Chapter 5 we saw that when an object is raised to a height y in the 
gravitational field on its descent the gravitational force is capable of doing 
work on the object. Because of this, there is associated with the object at a 
height y a potential energy E, = mgy. We defined the potential energy as the 
work done in raising the object to that height. An analogous situation occurs 
here. When a body attached to a spring is displaced from its equilibrium 
position (x = 0), the spring is potentially capable, on the release of the body, 
to do work on the body. We can therefore associate with the spring-body 
system a potential energy E,. This potential energy will be the work done in 
stretching or compressing the spring. 

When the force F and the displacement dx are in the same direction, 
work was defined as the product of the magnitudes of the force and the 
displacement (see Eq. 5.1), that is, 


dW = F ax 
or 


W = |’ Fax 
0 


From Hooke’s law, Eq. 10.3, the force needed to compress or stretch a spring 
is F = kx; thus 


W=k | xdx 


1 
W = 5 kx 


The potential energy of the spring-body system, when the body is displaced 
a distance x from its equilibrium position, is therefore 


1 
E, (spring) = 5 kx (10.26) 


This equation was derived on the assumption that the spring was initially in 
its equilibrium position, x = 0. This assumption is not necessary. If the spring 
is initially in a position x; and is compressed or stretched to position x2, the 
work done is as before 


W=k | xdx 


1 1 

= 5 kx = 5 kx (10.27) 
Note that because the displacement is squared the potential energy of a spring 
is the same whether it is stretched or compressed an equal distance x from 
its relaxed position. 


: ' Dna 
E, (spring) = skx° 


If there is no friction we may expect, as was the case with the gravitational 
force, that the total mechanical energy, kinetic plus potential, will remain 
constant as the body oscillates. This can be shown rather simply. By the work- 
energy theorem, Eq. 5.9, the work done by the spring, as the body moves 
between two arbitrary displacements x; and x2, is equal to the change in the 
kinetic energy of the body; that is, 


2 1 1 
i) "Faring dx = 5 mo} — 5 moi (10.28) 


where v; and v, are the velocities of the body at x, and x2, respectively. The - 


force exerted by the spring on the body is Fy,.ing = —kx. Substituting this for 
Fspring in Eq. 10.28, and integrating the left side of the equation we obtain 


z 1 1 
7M dee = (563 -3 bi) 


Eq. 10.28 becomes 


1 1 1 1 
~ (5 — 5H) = 5 Inv — 5 vi 


Rearranging terms, we obtain 


1 1 1 1 
5 kext + 5 MI = 5 kx3 + 5 mvs (10.29) 


Because x, and x2 are arbitrary points we conclude that the total energy 


1 1 
ae = ED (; ke) “yr Ex (; me?) 


remains constant as the body oscillates. 

Note that when the spring is stretched, x = A. Before the object is released 
it has no velocity and 1/2 kA? is the total energy of the system; after it is 
released the energy remains constant because energy neither enters nor leaves 
the system. 

In Section 10.5 we saw that the velocity was a maximum when the dis- 
placement was zero, and it was zero when the displacement was a maximum. 
This result is intimately tied to the fact that the total mechanical energy of 
the system remains constant. Therefore, the kinetic energy (and hence the 
velocity) will be a maximum when the potential energy is a minimum, that 
is, when x is zero. The kinetic energy will be a minimum (zero) when the 
potential energy is a maximum, that is, when x = A. 


Example 10-3 


The block of Example 10-2 is released from a position of x; = A = 0.2 mas 
before. (a) What is its velocity at x. = 0.1 m? (b) What is its acceleration at 
this position? 


ENERGY OF OSCILLATION 


; ha ae = 
E total _ 5 x T > moe 
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Solution 


(a) The velocity at x2 can be found with the conservation of energy 


equation, Eq. 10.29 
1 1 


1 1 
— kxt + = moj = = kxé + = mvs 


2 yi 2 


Solving for v2, noting that v; = 0, we obtain 


ae _ ay 
On = oe 


m 


_ 22° N/m [(0.2m)? — (0.1 my" 


— 10 m/sec? 


2 kg 
= 1.73 m/sec 
(b) We may find the acceleration at this position by using Newton’s 
second law 
F = ma 
—kx = ma 
A _ kx _ (200 N/m)(0.1 m) _ 
m 2 kg 
PROBLEMS 


10.1 Show that x = A sin wt + B cos wt, where A 
and B are arbitrary constants, is a solution of Eq. 
10.10. 


10.2. The position of a particle undergoing oscilla- 
tions is given by x = 25 sin (3at + a/5), where x is 
in centimeters and ¢t in seconds. Find (a) the fre- 
quency of the motion, (b) the amplitude of the mo- 
tion, (c) the maximum velocity of the particle, (d) 
the maximum value of the acceleration of the par- 
ticle, (e) the position, the velocity, and the acceler- 
ation of the particle at t = 0. 


10.3. The same block on the same spring as in Ex- 
ample 10-2 is released after being pulled 0.2 m to 
the right. Find its position, velocity, and acceleration 
0.1 sec after being released. 


10.4 Asmall block attached to a spring is oscillating 
horizontally on a frictionless surface with an ampli- 


tude of 0.12 m. When it is at the position x = 0.05 m 
its velocity is 2 m/sec. (a) What is its frequency of 
oscillation? (b) What is its position when its velocity 
is 1 m/sec? 


(Answer: (a) 2.92 Hz, (b) 0.107 m.) 


10.5 An oscillating block of mass 250 g takes 
0.15 sec to move between the endpoints of the mo- 
tion, which are 40 cm apart. (a) What is the fre- 
quency of the motion? (b) What is the amplitude of 
the motion? (c) What is the force constant of the 
spring? 


10.6 When a mass of 0.2 kg is suspended from a 
spring, it stretches 0.04 m. The mass is pulled 
down an additional distance 0.1 m from its equilib- 
rium position and released. (a) What is the spring 
constant? (b) What is the period of oscillation? 


(c) What is the frequency of oscillation? (d) What 
will be the maximum velocity? 

(Answer: (a) 49 N/m, (b) 0.40 sec, (c) 2.5 Hz, 

(d) 1.57 m/sec.) 


10.7. (a) Write down the equation for the position 
y (measured from the equilibrium position) of the 
mass in problem 10-6. (b) What is the equation for 
the velocity of the mass as a function of time? 
(c) What is the equation for the acceleration of the 
mass as a function of time? Take positions below the 
equilibrium point as positive. 


10.8 (a) How long after being released is the po- 

sition of the mass in problems 10-6 and 10-7 equal 

to 0.05 m? (b) What is the velocity of the mass when 

y = 0.05 m? (c) What is the acceleration of the mass 
when y = 0.05 m? | 

(Answer: (a) 6.7 X 10~2 sec, (b) —1.36 m/sec, 

(c) —12.25 m/sec’.) 


10.9 The position of an oscillating particle is given 
by x = Asin (wt + ¢), Eq. 10.9. A particle of mass 
m = 0.5 kg is connected to a spring of force constant 
k = 200 N/m. The particle is initially at rest on a 
frictionless table. The particle is given an initial ve- 
locity of 1.5 m/sec to start oscillating. What is the 
amplitude of the motion A and the phase angle ¢? 

(Answer: 7.5 X 107? m, 0 rad.) 


10.10 Two springs with force constants k, = 
100 N/m and kz = 200 N/m are connected to opposite 
ends of a block of mass 3 kg (see Fig. 10-8). (a) If 
the block is displaced 0.1 m to the right, what is the 
net force exerted by the springs on the block? The 
block is released from that position. (b) What are the 
frequency and the period of the motion? (c) What is 
the amplitude of the motion? (d) Find an expression 
for the position of the particle as a function of time? 


10.11 A block of mass 2 kg sits on a platform that 
is oscillating in a vertical plane with an amplitude 
of 10 cm (see Fig. 10-9). If the frequency of oscillation 
is 1 Hz, what is the normal force exerted by the 


FIGURE 10-8 
Problem 10.10. 
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platform on the block (a) as they pass the equilibrium 

point, (b) at the lowest point of the motion, (c) at 

the highest point of the motion? (d) If the frequency 

remains constant, at what value of the amplitude 
will the block and the platform separate? 

(Answer: (a) 19.6 N, (b) 27.5 N, (c) 11.7 N, 

(d) 24.8 x 107? m.) 


- i ne 
| | f) 
S|) 4 Ho 

| € 
| og 
| | 
— 
| § 
| = 
FIGURE 10-9 —_—_ | Ss 


Problem 10.11. 


Sie 


10.12. A block is oscillating horizontally on a tic 
tionless table with an amplitude of 5 cm. A coin of 
mass m = 3 g is placed on top of the block. The 
maximum force of friction between the coin and the 
block is 0.015 N. What is the maximum value of the 
frequency of oscillation for which the coin will stay 
on top of the block? 

(Answer: 1.59 Hz.) 


10.13 A block is oscillating with an amplitude of 
20 cm. The spring constant is 150 N/m. (a) What is 
the energy of the system? (b) When the displacement 
is 5 cm, what is the kinetic energy of the block and 
the potential energy of the spring? 


10.14 A0.25-kg mass is oscillating with a frequency 
vy = 5 Hz. What is the amplitude of the motion if 
the energy of the system is 12 J? 


10.15 A mass is oscillating with amplitude A. 
(a) When the displacement is x = 3A, what fraction 
of the energy is potential and what fraction is ki- 
netic? (b) For what value of x in terms of A will the 
energy be half kinetic and half potential? 

(Answer: (a) 4, #, (b) 0.707A.) 


10.16 A wooden block of mass 0.8 kg rests on a 
frictionless table connected to a spring (k = 200 N/m) 
as shown in Fig. 10-10. A 20-g bullet moving with 
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a velocity v = 500 m/sec is shot into the block and 
remains embedded in it. What is the amplitude of 
the ensuing oscillatory motion? 

(Answer: 0.78 m.) 


M 0.8 ke 


FIGURE 10-10 ‘ron 000655 
Problem 10.16. 2 ===" S=" 


10.17. A block of mass m, = 3 kg rests on a fric- 
tionless surface connected toa spring (k = 150 N/m). 
A second block of mass m2 = 1kgis launched toward 
m, with a velocity of 4 m/sec (see Fig. 10-11). After 
the collision, m2 bounces back in the opposite di- 
rection with a velocity of 1 m/sec. (a) How much will 
the spring be compressed? (b) What fraction of the 
energy is lost in the collision? 

(Answer: (a) 0.24 m, (b) 0.42.) 


FIGURE 10-117 
Problem 10.17. 


10.18 A spring (k = 200 N/m) is compressed 10 cm 
between two blocks of mass m, = 1.5 kg and m, = 
4.5 kg (see Fig. 10-12). The spring is not connected 
to the blocks, and the table is frictionless. What are 
the velocities of the blocks after they are released 
and lose contact with the spring? Assume that the 
spring falls straight down to the table. 


FIGURE 10-12 
Problem 10.18. 


10.19 A mass of 3 kg is connected to a spring of 
force constant 250 N/m on a horizontal surface. The 
coefficient of friction between the block and the sur- 
face is 0.1. The block is pulled 20 cm to the right 
and released. (a) How far to the left of the equilib- 
rium point will the block move? (b) What is the total 
back and forth distance traveled by the block before 
it stops? 

(Answer: (a) 0.176 m, (b) 1.70 m.) 
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11.1 INTRODUCTION 


Waves are an important concept in physics. We can see water waves and 
readily demonstrate sound waves with elementary laboratory experiments. 
In 1801 Thomas Young showed that light can be considered waves by ex- 
perimental analogies to the behavior of water waves. It will be shown in a 
later chapter that experiments with fundamental particles, such as electrons, 
demonstrate that they also have wave characteristics. 

If we crack a whip, we produce a brief transverse displacement that can 
be seen to travel to the end of the whip. If we drop a pebble on the calm 
surface of a pond, a circular ripple is produced. This ripple travels away from 
the point where the pebble hit the water with constant speed and, as it reaches 
a given point of the water surface, it produces a temporary transverse dis- 
placement of the water molecules. These are examples of traveling waves, 
which can transmit energy along a medium without any net translation of 
the particles in the medium through which the wave travels. These readily 
visible experiments with water or strings led early scientists to conclude that 
a wave is a disturbance that travels in a medium. When light was shown to 
have wave characteristics, even though it can travel in the vacuum of space 
that exists between the earth and the stars, an erroneous conclusion was 
drawn that there must exist a medium permeating the entire universe. This 
was called aether. As we will see in Chapter 16, because of the nature of the 
light wave, no medium is necessary for its propagation. Before we enter into 
the realm of modern physics, we must have a solid understanding of wave 
motion: the mathematical description of a wave, the parameters that char- 
acterize it, the laws that govern its propagation. In the development of these 
ideas we will consider waves in a visible medium such as a string or a water 
surface. 


11.2 WAVELENGTH, VELOCITY, 
FREQUENCY, AND AMPLITUDE 


Suppose we are sitting in a boat on a body of water in which there is some 
wave motion, as in Fig. 11-1. If we measure the time between risings on the 
waves, we have a quantity known as the period, with symbol T, which is the 
time between successive risings. If instead, we ask how many risings did we 
experience per unit time, such as 1 h, this quantity is called the frequency, 
with symbol v. Here vy = number of risings/unit time. Just as in the case of 
oscillatory motion considered in Chapter 10, the period and the frequency 
are reciprocals of each other. 


p= (11.1) 


If we ask a friend in another boat to row away from us to a location such 
that he rises at the same instant that we do, the distance between us is called 


FIGURE 11-1 
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the wavelength of the wave with symbol A. The speed of the wave through 
the water is the distance between the boats divided by the time it takes for 
our rising to reach him. This is called the wave velocity, with symbol v, and 


jaa 
iT 
or, from Eq. 11.1, 
v=dv | (11.2) 


This is a fundamental equation obeyed by all waves. 

Another important parameter that characterizes a wave is its amplitude. 
The amplitude of a wave is the maximum value of the displacement it produces 
(see Fig. 11-1). 


11.3 TRAVELING WAVES IN A STRING 


Let us have a very long string stretched along the x direction. A pulsed 
displacement, such as plucking the string, introduced at one end of the string, 
will cause a transverse displacement of the string in the y or z direction. Let 
us for now consider only the transverse y direction because it is more easily 
drawn on a flat sheet of paper. 

At t = 0, when the pulse is introduced, the string may look as in Fig. 
11-2a. The wave pulse travels along the string and, consequently, the string 
will look differently some time later, see Fig. 11-2b. This clearly indicates that 
the transverse displacement of the string, y, varies with x—that is, the point 
of the string under consideration—and with time t. In mathematical terms 
we say that the wave pulse, y, is a function of x and t; that is, y = f(x,t). The 
exact shape of the wave pulse, the precise form of the function f(x,t), will be 
determined by the source producing the pulse and the nature of the string. 
One of the most important and most commonly found types of traveling 
waves is the sinusoidal traveling wave, a wave consisting of a series of con- 
secutive sinusoidal pulses. 

Let us attach the end of the string (x = 0) to-a block connected to a spring 
hanging from the ceiling, as shown in Fig. 11-3. If we pull on the block, thus 
stretching the spring, and then release it, we know that the block will begin 
to oscillate. The y coordinate of the block, and therefore the transverse dis- 
placement of that end of the string, will be given by Eq. 10.9 


y(x = 0,t) = Asin (wt + ¢) (11.3) 


This transverse displacement, introduced at x = 0, moves along the string 
and, as aresult, sometime later other points on the string will begin to oscillate 
in the transverse y direction. If the velocity of the wave in the string is v, 
then the time it takes to travel a distance x along the string is x/v. Thus, at 
time t the displacement produced by the wave at a point x is the same as was 
the displacement at the origin (x = 0) at an earlier time t — fp, where fo is 


(er) 

FIGURE 11-2 

A transverse displacement in a 
string traveling in the positive x 
direction. 
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FIGURE 11-3 

Arrangement tor the production 

of sinusoidal traveling waves in 

a long string under tension. 
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the time that it takes the wave to reach x; that is, tp = x/v. Putting t — to into 
Eq. 11.3 for t, we obtain 
y(x,t) = Asin [a(t — fo) + 6] 


and on substituting tp = x/v we have 


y(x,t) = Asin ( _ me + 6) (11.4) 


Eq. 11.4 is the general form of the wave equation, but for our purposes 
we do not require such generality. If we limit ourselves to waves such that 
y = 0 when both x = 0 and t = 0, then @ = O or aw. We can then write 
the commonly used versions of Eq 11.4 as 


y(x,t) = Asin (wt — kx) when ¢ = 0 (11.5) 
or 
y(x,t) = Asin (kx — wt) when ¢ = 7 (11.5’) 
where 
@ 
k = 5 (11.6) 


Either of these versions of the travelling wave equation may be used, and 
the choice is a writer's preference. 

The constant k that we have introduced is called the propagation constant 
(or wave number). (Note that this k is a new and different constant from the 
spring constant k introduced in Chapter 10.) It will be used extensively in the 
latter part of this book. Its physical significance will soon become apparent. 

We can write an alternative representation of a sinusoidal traveling wave 
that differs from that of Eq. 11.5 in the direction of propagation of the wave 
that it represents. Equation 11.5 was derived by assuming that the wave 
traveled toward the right in the positive x direction. For a wave traveling 
toward the left, 


y(x,t) = Asin (kx + of) (11.7) 


To show that Eq. 11.7 represents a wave traveling in the negative x direction, 
we look at a particular value of y, the wave displacement, and ask ourselves, 
as time t increases, what happens to x for that particular y value of the wave? 
To pick a particular, fixed value of y, the argument of the sine function in 
Eq. 11.7 must be kept constant, that is, 


kx + wt = constant (11.8) 


Obviously, as t increases, x must decrease if the left side of Eq. 11.8 is to 
remain constant. Therefore, the wave of Eq. 11.7 is moving toward decreasing 
values of x, that is, in the negative x direction. 

To understand a traveling wave in a string produced by an oscillating 
source such as that of Fig. 11-3, let us examine some stop-action diagrams, 


y(x,f) = A sin (kx 


wt) 
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(b) (c) (d) (e) (f) 


(hr) (i) 

FIGURE 11-4 

Stop-action diagrams showing the introduction of a traveling wave in a string by the 
experimental arrangement of Figure 11-3. 


such as those in Fig. 11-4. We see that the y displacement introduced by the 
oscillation travels to the right in the detailed sketches of Fig. 11-4a-i. More 
than one full wavelength is represented in Fig. 11-4¢-1. 

To gain insight into the physical significance of the wave represented by 
either Eq. 11.5’ or Eq. 11.7, let us analyze it from two different points of view. 

Suppose we take a snapshot of the string as the wave travels through it. 
What will we see? Taking a snapshot of the string means setting t equal to 
an instantaneous value ft; in Eq. 11.5’, which now becomes 


y(x,t1) = A sin (kx — 6,) (11.9) 


where 6, = ot, is a phase shift at t;. This is shown best in Fig. 11-5 in comparison 
with a sine function plotted as a dashed line. In this figure the solid line is 
the snapshot at time ft, and the dashed line is a sine curve with y = 0 at the 
origin. At the time of the photograph the solid line is phase shifted by an 
angle 0, = wt, from the sine curve of y = 0 at x = 0. At some other time f, 
the snapshot would show a different phase shift 6.. Even though there is a 
phase shift, the string looks like a sine wave. We can use Eq. 11.9 to find the 
wavelength A of the wave. In Section 11-2, we defined A as the separation 
between two risings in the body of water of Fig. 11-1. Similarly here, A is the 
separation between two successive maxima in the transverse displacement of 
the string; for example, from Fig. 11-5 


A= X%-— XxX 


We identify x, and x2 as two successive values of x for which the sine function 


FIGURE 11-5 
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in Eq. 11.9 equals +1, that is, 

kx; — 0, = = rad (11.10) 
and 


kx» = 0; 


€ i 2m) rad 11.11) 


Solving Eqs. 11.10 and 11.11 simultaneously, we obtain 


kx» a kx = 29 


Xo — xX = es 
2 k 
2 
A= a (11.12) 


We see, therefore, that the propagation constant k in Eq. 11.5’ yields, 
through the relation of Eq. 11.12, the wavelength of the wave. 

An alternative way of looking at Eq. 11.5’ is to consider a particular point 
in the string of Fig. 11-47 and to analyze the motion of that point as a function 
of time. We can place a little light bulb at that point in the string and follow 
the motion of the bulb. Suppose we choose the particular point in the string 
as x, in Fig. 11-41. This means we set x equal to a constant value x; in Eq. 
11.5’, which now becomes 


y(x,,t) = Asin (6; — at) (11.13) 


where 6; = kx, is a constant phase shift that depends on the point chosen 
in contrast to the previous analysis, which showed that the phase shift de- 
pended on the time of the snapshot. We immediately recognize Eq. 11.13 as 
being similar to Eq. 10.9, which described the oscillatory motion of the body 
attached to a spring. With the position x fixed, y will vary with sin wt and 
the little bulb will undergo simple harmonic motion with amplitude A and 
frequency 


@ 
aoe (11.14) 
It should be noted that Eq. 11.13, with a change in x and therefore with the 
corresponding phase shift 6;, describes the motion of any other point in the 
string. As the wave moves through the string, all the particles in the string 
oscillate with the same amplitude and frequency, although out of phase with 
one another, that is, with different phase shifts. 
We conclude this section with an alternative demonstration of the relation 
between frequency and wavelength, Eq. 11.2. Combining Eq. 11.14 and Eq. 
11.12, we have the product 
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By definition k = : (Eg. 11.6), therefore 


=v (11.2) 


el1elé 


which is the result found earlier. This result is valid for all waves whether in 
a medium such as a string, or water or air, or in a vacuum such as light 
waves. | 


Example 11-1 


A mass of 0.2 kg suspended from a spring of force constant 1000 N/m is 
attached to a long string as shown in Fig. 11-3. The mass is set into vertical 
oscillation, and the distance between successive crests of the waves in the 
string is measured to be 12 cm. What is the velocity of waves in the string? 


Solution We use Eq. 10.12 to find the frequency of oscillation 


w oo —— Ue 70.71 rad/sec 


0.2 kg 
@ _ 70.71 rad/sec 
= 25, 11.26 Hz 
The velocity of waves in a medium can be found with Eq. 11.2 
v = Av 
= 12 x 10°-*m X 11.26 sec” 
= 1.35 m/sec 


11.4 ENERGY TRANSFER OF A WAVE 


One of the most important aspects of wave motion is that it provides a 
mechanism for the transfer of energy. A particle in a string before the wave 
arrives has no mechanical energy. If a sinusoidal wave arrives at the location 
of the particle, the particle begins to execute simple harmonic motion, and it 
therefore acquires kinetic as well as potential energy. The wave has given energy 
to the particle because the wave carries energy with tt. 

In fact if we think carefully, wave motion is one of the two general 
mechanisms available to transport energy from one point to another. The 
other occurs when one or more particles move from one point to another and 
in so doing bring their kinetic energy with them. This kinetic energy can be 
transferred to other particles in the medium through which they propagate. The splashing of the water caused 
There are, however, two obvious differences between these two mechanisms; by the sound waves of a tuning 
one of them will be crucial in the development of quantum mechanics in a fork is a vivid illustration of the 
later chapter. transfer of energy by a wave. 


152 WAVE MOTION 


1. The first difference is that waves transfer energy without transfer of matter, 
unlike the motion of particles. 


2. The second is that the energy of a beam of particles is localized (it is where 
the particles are at a given instant). In a wave the energy is distributed 
over the entire space occupied at a given instant by the wave. (When the 
ripples in the pond move outward from their source, all the water in the 
region of the wave is displaced.) 


We will now calculate the rate—that is, energy per unit time—at which 
energy is transmitted by a wave in a string, noting that a similar calculation, 
leading to similar results, may be made for any other type of wave. Let us 
consider the sinusoidal traveling wave represented by Eq. 11.5’. 


y = Asin (kx — at) (11.5’) 


The rate of energy production, consumption, or transmission was defined 
in Eq. 5.15 as power P. We can obtain P by calculating the energy crossing 
a given point in a string in 1 sec, for example, point D in Fig. 11-6. This will 
be equal to the wave energy of the string particles in one wavelength mul- 
tiplied by the number of wavelengths passing point D in 1 sec, that is, by 
the frequency ». 


P = (energy per wavelength) x v (11.15) 


To find the energy in one wavelength we note, as shown in the previous 
section, that each particle in the string is oscillating with the same amplitude 
A. Because the total energy of an oscillating particle is proportional to the 
square of the amplitude of oscillation (see Eq. 10.26 et seq.), we conclude that 
all the particles in the vibrating string have the same energy. At any given 
time, the energy of a particular particle may be all kinetic or all potential or 
a mixture. In Fig. 11-6, the energy of particle C is all kinetic, because C is 
passing through the equilibrium point. On the other hand, the energy of 
particle B is all potential, because it is about to reverse the direction of its 
transverse motion and its velocity is zero. 

To obtain the kinetic energy of the particles in the string we need an 
expression for the transverse velocity v,. This can be obtained by differen- 
tiating y in Eq. 11.5’ with respect to time. Because x is also a variable in the 
expression for y, we will indicate its omission in the derivative by writing v, 
as a partial derivative with respect to t; that is, 


ay 
dy = = = Asin (kx ~ of 


Vy = —wAcos (kx — at) (11.16) 


We can now calculate the energy of particle C of mass Am. 


1 
E (for particle C) = 5 AM Vjmax (11.17) 


FIGURE 11-6 


But from Eq. 11.16 Vymax = —@A, because the maximum value of the cosine 
function is 1. Eq. 11.17 becomes 


E (for particle C) = : Am w*A? (11.18) 
Because the energy of all the particles is the same, we can obtain the energy 
in one wavelength by replacing Am in Eq. 11.18 with the mass contained in 
one wavelength. If the amplitude of the wave is small compared with the 
wavelength (a situation often satisfied), then the mass in one wavelength is 
uA, where p is the mass per unit length of the string. Therefore, | 


1 
Energy per wavelength = 5 paw A? (11.19) 


Substituting Eq. 11.19 for the energy per wavelength in Eq. 11.15, we obtain 


1 
P= 5 pave A? 


Or 


P = 202pvv7A? (11.20) 


where we have made use of the fact that Av = v and w = 277. 

Although Eq. 11.20 has been derived for a wave ina string, two important 
features hold for any other type of wave: The power transported by a wave ts 
proportional to the square of the amplitude and to the velocity of propagation of 
the wave. We will use these important results later in our discussions of the 
principles of modern physics. 

When considering waves that propagate in three dimensions, such as 
sound waves or light waves, it is convenient to talk about the energy flowing 
through a given area of the medium traversed by the wave. The unique term 
intensity, with symbol I, is used for this purpose. The intensity is defined as 
the power transmitted per unit area perpendicular to the direction of prop- 
agation of the wave. Clearly, intensity and power are related by a simple 
geometric factor. Thus, the intensity of the wave is also proportional to the 
square of the amplitude. In the SI system, intensity has units of W/m’. 
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| ~ (Amplitude of wave) 


11.1 The speed of sound in air is about 330 m/sec, 
whereas the velocity of light is 3 x 10° m/sec. If you 
see a flash of lightning and count 8 sec before you 
hear the thunder, how far away was the lightning? 

(Answer: 2.64 x 10° m.) 


11.2 If the principal audio frequency of a thunder- 
clap is the lowest the ear can hear, about 20 Hz, 
what is the wavelength of the sound wave? 


11.3. The speed of all electromagnetic waves in air, 
both visible and invisible, is 3 x 10° m/sec. The 
AM radio band ranges in frequency from 550 to 
1600 kHz. What is the range of wavelengths? The 
FM band ranges from 88 to 108 MHz (1 MHz = 
10° Hz). What is its range of wavelengths? 


11.4 The range of sound frequencies detectable by 
the human ear is 20 to 20,000 Hz. What is the range 
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of wavelengths? The velocity of sound in air is 330 
m/sec. 


11.5 A rule of thumb for finding the distance where 
a flash of lightning occurs is to count the number of 
seconds from the moment one sees the lightning to 
the moment one hears the thunder. The distance in 
kilometers is the number of seconds divided by 3. 
How accurate is this rule? (See problem 11-1.) 


11.6 In an experiment designed to measure the ve- 
locity of sound waves in copper, a blow is struck at 
one end of a copper rod. Detectors at the other end 
measure the time interval between the arrival of the 
sound pulse through the rod and the arrival of the 
sound pulse through the air. If the rod is 3 m long 
and the sound pulse that traveled through the rod 
arrives 8.01 X 10~° sec earlier than the sound pulse 
that traveled through the air, what is the velocity of 
sound in copper? The velocity of sound in air is 330 
m/sec. 

(Answer: 2775 m/sec.) 


11.7 In the wave configuration shown in Fig. 11-7 
the length of the string is 4.2 m, the frequency of 
the wave is 1.2 Hz, and the amplitude is 0.05 m. 
What is the speed of the wave? Note that A >> 
amplitude. 

(Answer: 2.52 m/sec.) 


FIGURE 11-7 i 


Problem 11.7. ~ 4.2 1 a 


11.8 The equation of a transverse wave traveling 
along a very long string is given by 


y = 6.0 sin (0.0207 x + 4.07 #) 


where x and y are expressed in centimeters and f in 
seconds. Find (a) the amplitude, (b) the wavelength, 
(c) the frequency, (d) the speed of propagation, (e) 
the direction of propagation of the wave, and (f) the 
maximum transverse speed of a particle in the string. 


11.10 The equation of a traveling wave in a long 
stretched string is y = 107 sin (32t — 4x)m, where 
x is in meters and t isin seconds. What is the velocity 
of the wave in the string? 


Write the equation of motion of a traveling 


wave for the string in problem 11-7 using the same 
amplitude. Assume that the wave travels in the pos- 
itive x direction. 


11.11 Write the equation for a wave traveling in 
the negative direction along the x axis and having 
an amplitude of 0.010 m, a frequency 550 Hz, and 
a speed 330 m/sec. 

(Answer: y = 0.010 sin (3.337 x + 11007 f) m.) 


11.12 Consider the situation illustrated in Fig. 11-3. 
Let the spring constant k = 200 N/m and the mass 
of the block m = 2 kg. The block is given an upward 
initial kick to start it oscillating. Sketch the shape of 
the string at t = 0.15 sec, t = 0.3 sec, f = 0.45 sec, 
t = 0.6 sec, andt = 1.2 sec. 


11.13 A triangular pulse of height 0.5 m and length 
2 m moves along the positive x direction on a string 
with velocity 12 m/sec (see Fig. 11-8). At tf = 0 the 
pulse is between x; = 1 mand x, = 3m. Plot the 
transverse velocity of point x2 as a function of time. 


FIGURE 11-8 ox ; 
Problem 11.13. in) 

11.14 A sinusoidal traveling wave on a string has 
a frequency v = 15 Hz anda velocity v = 7.5 m/sec. 
(a) How far apart are two points whose transverse 
displacements are phase-shifted by 30°? (b) At a par- 
ticular point on the string, what is the phase differ- 
ence between two displacements occurring 0.05 sec 
apart? 

(Answer: (a) 4.17 X 107? m, (b) 270°.) 


11.15 A block connected to a rigid rod is raised 
from some initial position y = 0 with constant ve- 
locity v, = 20 m/sec for 0.4 sec. The block is then 
suddenly lowered to its initial position and then raised 
again with the same velocity for the same amount 
of time. The cycle is repeated indefinitely. A long 
string under tension is attached to the side of the 
block (see Fig. 11-9). Let the wave velocity in the 
string be 5 m/sec. (a) Sketch the shape of the string, 
using approximate dimensions for the x and y co- 


ordinates, at t = 0.2 sec, t = 0.4 sec and ?t = 1.2 
sec. (b) What is the spacing of the pulses on the 
string? 


FIGURE 11-9 
Problem 11.15. 


—_————l 


11.16 A laser produces light pulses of energy 5 J 
and duration 2 x 10~° sec. The width of the beam 
is 1 mm?. What is the intensity of the laser light? 


11.17 A sinusoidal traveling wave of amplitude 
2 cm and wavelength 50 cm moves along a string 
with velocity v = 6 m/sec. (a) What is the maximum 
transverse velocity of the particles in the string? (b) 
What is the maximum transverse acceleration of the 
particles in the string? 

(Answer: (a) 1.51 m/sec, (b) 113.7 m/sec?.) 


11.18 If the string of problem 11.17 is 30 m long 
and has a mass of 6 kg, what is the power trans- 
mitted by the wave? 


11.19 Suppose that during the transmission of waves 
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through a string the frequency is suddenly doubled 
while maintaining the same amplitude and velocity 
of propagation, what will happen to the magnitude 
of the power transmitted? Suppose that the ampli- 
tude is doubled while holding the frequency fixed, 
what will happen to the power? 


11.20 Consider a point source emitting waves in 
all directions. If the medium through which the waves 
propagate is isotropic, the velocity of propagation 
will be the same in all directions. As a result, points 
in the medium where the wave has a certain phase 
are equidistant from the source and therefore lie on 
a spherical shell with the source at the center. These 
waves are called spherical waves. Consider waves 
emitted from a 5-W source in a nonabsorbing me- 
dium. (a) What is the intensity of the waves 1 m 
away from the source? (b) What should the power 
of the source be in order that the intensity of the 
waves 1 m away be the same as that of the laser in 
problem 11.16? 


11.21 A point source emits spherical waves in a 
nonabsorbing medium (see problem 11.20.) The in- 
tensity at some unknown distance from the source 
is 25 W/m’. The intensity at some point 10 m farther 
away from the source is 16 W/m”. (a) How far is the 
source from the first point? (b) What is the power 
output of the source? 

(Answer: (a) 40 m, (b) 503 kW.) 
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12.1 INTRODUCTION 


In the preceding chapter we introduced the concept of waves as a periodic 
disturbance of a medium. We used familiar concepts such as waves on a string 
or in water to illustrate the phenomena. The general equation for a traveling 
wave was derived as was the concept of phase shift. In this chapter we will 
start with the behavior of two waves when they come together and the effect 
produced by their relative phase. We will first discuss this phenomenon with 
waves in water and then extend it to light waves. At this point we will assume, 
as did early investigators, that air could be the substance in which light-wave 
motion occurs. However, we will show in a later chapter that light waves do 
not require some substance or medium to support them. 


12.2 THE SUPERPOSITION PRINCIPLE 


One of the fundamental principles governing the propagation of waves is 
called the superposition principle. What happens when two different waves 
meet? Experiments show that waves can move through the same region of 
space independently and, as a result, when they meet the resultant wave is 
simply the algebraic sum of the individual waves. (The superposition principle 
does not hold for waves of very large amplitude in deformable media.) Figure 
12-1 shows what happens when a square pulse in the string meets a triangular 
pulse moving in the opposite direction. In Fig. 12-la two positive waves 
approach each other. The resulting displacement of the string is the addition 
of the displacements that each pulse would have produced in the absence of 
the other (Fig. 12-1b). After meeting, the waves move on unaffected (Fig. 
12-1c). If one wave pulse is positive and the other is negative, as in Fig. 
12-1d-f, the negatively directed pulse subtracts from the positive pulse. These 
two examples show that the resultant pulse while the two waves are passing 
one another is the algebraic sum of the two waves. 

The superposition principle leads to a wave phenomenon known as in- 
terference. Suppose two waves with the same wavelength, velocity, and am- 
plitude, but from different sources, travel together in the same direction. 
What will be the amplitude of the resulting wave? Figure 12-2a shows that if 
they are in phase, the total amplitude at any point will be the simple sum of 
the two. If they are out of phase by one-half wavelength, the resulting am- 
plitude will be zero (see Fig. 12-2b). The first case is called constructive inter- 
ference, and the second is called destructive interference. It is important to note 
that if either wave is shifted to the right or to the left by a whole wavelength, 
the situation is unchanged. However, if the shift is by only a half wavelength 
the situation is reversed; that is, constructive interference becomes destruc- 
tive and vice versa. 


(f) Ls 


FIGURE 12-1 

Superposition principle: (a) A 
square positive pulse and a tri- 
angular positive pulse in a string 
traveling in opposite directions. 
(b) The two pulses meet, and the 
resulting displacement of the 
string is the sum of the displace- 
ments that each pulse would have 
produced in the absence of the 
other. (c) After meeting, the 
pulses move away unaffected. (d) 
A positive square pulse and a 
negative triangular pulse mov- 
ing in opposite directions. (e) The 
pulses meet, the resulting dis- 
placement is the difference (al- 
gebraic sum) of the two pulses. 
(f) After meeting, the two pulses 
move away unaffected. 
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12.3 INTERFERENCE FROM TWO SOURCES 


If pebbles or water drops fall at regular intervals in still water, a pattern of 
circular waves, each constantly increasing in radius, will be established. If a 
similar situation with the same frequency of disturbance occurs nearby, the 
circular traveling waves will cross one another, producing an interference 
pattern. Note that the restriction of the previous section that the waves travel 
together in the same direction has now been removed. At some points the 
interference will be constructive, and at others it will be destructive. A simple 
laboratory demonstration of this is shown in Fig. 12-3. This is a photograph 
of what is called a ripple tank. This is a tray of water illuminated from below. 


FIGURE 12-3 

Ripple-tank demonstration of the phenomenon of interference. Two vibrators striking 
the water surface in the tank at periodic intervals produce two circular wave patterns. 
As the two wave patterns cross each other, an interference pattern results. Along the 
solid lines the waves from the two sources interfere constructively, that is, the dis- 
placement of the water is large. Along the dashed lines the interference is destructive, 
and the displacement of the water is zero. There are additional paths of constructive 
and destructive interference that have not been marked with lines. 
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FIGURE 12-2 

Interference of two waves: (a) 
Constructive interference of two 
waves traveling together in 
phase, that is, with the ampli- 
tudes coinciding, resulting in a 
wave with an amplitude that is 
the sum of the amplitudes of the 
individual waves. (b) Destructive 
interference of two waves of equal 
amplitude traveling together with 
a phase difference of one-half 
wavelength resulting in a wave 
of zero amplitude. 
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Instead of having drops of water falling, two vibrators are placed in the tray 
and the frequency and amplitude of vibration can be controlled more accu- 
rately than can that of falling drops of water. The centers of the circles at the 
bottom are the locations at which the vibrators are located. Some of the paths 
along which constructive interference occurs are indicated with solid lines. 
The dashed lines between the paths labeled 0 and 1 are regions of destructive 
interference. 

Figure 12-4 is a schematic representation of the photograph in Fig. 12-3 
in which the wave crests, represented by the circular lines, proceed outward 
from the two sources S, and 53. It is seen in this figure that the phenomenon 
of Fig. 12-3 is a purely geometric one that can be reproduced on paper with 
a compass and ruler. The distance between the crests is the wavelength A. 
The troughs are located halfway between the crests. As in the photograph, 
the solid lines labeled 0, 1, 2, . . . represent the paths of constructive inter- 
ference. 

We notice in this figure that the paths of constructive interference are 
symmetric about the line labeled 0. Therefore, we need to consider only one 
group either above or below the 0 line. We will consider the ones above, 
knowing that the results will be the same for the ones below. Constructive 
interference occurs along these paths because the crests from the two sources 
coincide and add to the disturbance of the water. This is the criterion presented 
in the previous section for constructive interference. Along the path labeled 
0 in Fig. 12-4, the first crest from 5, coincides with the first crest from 52, the 
second crest from S, with the second from 52, and so on. Along the path 
labeled 1, the first crest from S, coincides with the second crest from S,, the 
second from 5, with the third from 5,;, and so on. We may view these waves 
as having traveled for some distance from their sources along their respective 
paths. When two waves travel in the same medium, the difference in the 
distances traveled by them from their respective sources to a common point 
is called the path difference. Keeping in mind that the separation between 
successive crests is the wavelength A, we can now state the criterion for 
constructive interference as follows: When waves from two sources are emit- 
ted in phase, constructive interference occurs when the path difference is zero, or 
one wavelength, or an integral multiple of wavelengths nv. This can be formally 
shown with the trigonometric relation for the sum of sines, 


1 1 
sina + sin b = 2 sin 5 (a + b) cos; (a — Bb) 


: (12.1) 


Let us consider a point P whose distances from S, and S, are x, and x, 
respectively, as shown in Fig. 12-5. From Eq. 11.5’, the wave y, from S, and 
the wave y> from S, at P are 


y, = Asin (kx, — at) 
Y2 = Asin (kx. — at) 


FIGURE 12-4 

Geometric representation of the 
photograph in Fig. 12-3. The 
crests of the water waves are rep- 
resented by circular lines whose 
centers coincide with the loca- 
tion of the vibrators. The dis- 
tance between adjacent crests is 
the wavelength. The troughs are 
halfway between the circular 
lines. The crests from the two 
sources coincide along the solid 
lines labeled 0, 1, 2, where con- 
structive interference occurs. 
Along the dashed lines, the crests 
from one source coincide with the 
troughs from the other, resulting 
in paths of destructive interfer- 
ence. 


FIGURE 12-5 
Arbitrary point P in Fig. 12-3 or 
12-4. 
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From the superposition principle, the resulting wave will be 


yy Ye 
= A [sin (kx, — ot) + sin (kx. — ot)] (12.2) 
Now let the path difference be an integral multiple of the wavelength, that 
iS, 
Xo — xX, = na, where n = 0, £1, +2,... 
and 


Xo =X, + na 


or, because from Eq. 11.12 


27 
a 
X2 = xX, + al 
=o 
then Eq. 12.2 becomes 
y = A [sin (kx, — wt) + sin (kx, + 27 — aot)| (12.3) 


Using Eq. 12.1 gives 


1 1 
y = 2A sin 5 (kx, — wt + kx, — wt + 27m) cos 5 (277) 


y = 2A sin (kx; — af) (12.4) 
In the last step we let kx; — wt = @ and used the fact that sin (@ + nm) = 
—sin 6 and cos naz = —1 if n is an odd integer, and sin (0 + nz) = sin 6 


and cos na = 1 if n is an even integer. Equation 12.4 formally verifies that 
when the path difference is an integral multiple of the wavelength, the re- 
sulting wave has an amplitude that is twice that of y; or y2 at that point. 

Returning to Fig. 12-4, we note that the dashed lines, representing paths 
where destructive interference occurs, correspond to points whose distances 
from S, and S, differ by 1/2 A, or (from Eq. 11.12) x. — x; = A/2 = alk. The 
resulting wave in this case will be 

y= YT Ye 
= Alsin (kx, — wt) + sin (kx, + 7 — abt)| 


Using Eq. 12.1, we obtain 


1 1 
y = 2A sin 5 (kx — wt + kx, -— wt + 77) COS 5 7 


But cos 7/2 = 0, and therefore y = y; + y2 = 0; this by definition is destructive 


; ; . 3 5 
interference. The same result is obtained if x2 — x; = 5 A or 5 A and so on. 
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12.4 DOUBLE SLIT INTERFERENCE OF LIGHT 


If a series of either plane waves or large radius spherical waves strike a barrier 
with a small opening, circular waves are propagated beyond the opening as 
if the opening were a point source. The enlarging circumference of these 
waves is called a wave front. Figure 12-6 illustrates this propagation with water 


Christian Huygens (1627-1675) 


FIGURE 12-7 
(Geometric representation ot the 


phenomenon in Fig. 12-6. The 
opening in the barrier produces 
a circular wave front. Each point 
in that wave front produces sec- 
ondary circular wavelets, and the 
FIGURE 12-6 new propagated wave front is the 
Huygens’ principle. Parallel wave fronts in the ripple tank strike a barrier witha small tangent to these secondary wave- 
opening. The opening becomes a source of circular waves. lets. 
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waves in a still tank. This phenomenon illustrates what is known as Huygens’ 
principle after the Dutch physicist, Christian Huygens (1629-1695). It is more 
generally stated that every point on a wave front can be considered as a source 
of secondary wavelets that spread out in all directions with a speed and 
wavelength equal to those of the propagating wave. The newly propagated 
wave front is the tangent to these secondary wavelets. Thus, the spherical 
wavelet passing through the opening in the barrier of Fig. 12-6 produces a 
secondary wave front that itself produces other wavelets as shown in Fig. 
12-7. 

Huygens demonstrated that the known facts about the propagation of 
light could be explained by using his principle. It was many years, however, 
before light was accepted as a wave phenomenon. This came about when the 
English physician Thomas Young (1773-1829) performed the first successful 
experiment that exhibited the interference of light in 1801. The nature of light 
is discussed in more detail in Chapter 16. For now, we mention that visible 
light has a wavelength that ranges from about 4 x 10°’ mto7 x 10°77 m, 
where the lower values appear to us as violet and the higher values as red. 
A unit of length often used in specifying the wavelength of light is the Angs- 
trom, with symbol A;1A = 107-8 cm = 107?!°m, and therefore the wavelength 
of visible light ranges from about 4000 A to 7000 A. 

Figure 12-82 illustrates a schematic arrangement, similar to the one used 
by Young, to determine the phenomenon of interference with light. A mono- 
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(a) Projecting Screen (b) 

FIGURE 12-8 

(a) Schematic of Young’s double slit interference of light experiment. Light passing 
through two small slits 5, and 5S, in an opaque screen produce an interference pattern 
on a projecting screen to the right of the slits. (b) Photograph of the interference 
pattern on the projecting screen showing a pattern of alternating bright and dark 
fringes. (Source: Cagnet et al., Atlas of Optical Phenomena, Springer-Verlag, New York, 
1971.) 
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FIGURE 12-9 
(a) Geometric construct of lines from each of the slits in Fig. 12-8 to a point P on the 


projecting screen where constructive interference is observed. (b) When D >> d, the 
two lines are approximately parallel. The difference between the two paths traveled 
by the light from S; and S, is 6 = na. 


chromatic light source (a single wavelength) shines on an opaque screen 
with two narrow slits S; and 5. According to Huygens’ principle, these two 
slits become point sources of light. If we let the light that passes through the 
slits fall on a screen, we will observe a pattern of bright and dark lines that 
indicates constructive and destructive interference. A plot of the light intensity 
on the screen is schematically represented in Fig. 12-8a. Figure 12-8) is a 
photograph of the interference pattern observed on the screen. 

We can use the principles developed in the previous section to find 
expressions for the position of the interference maxima and minima. Figure 
12-9a shows a geometric construct of lines drawn from each of the two slits 
to a point P of constructive interference. As will become evident soon, for 
the interference pattern to be easily observable, the separation between the 
slits, d, cannot be much greater than the wavelength. This implies that in the 
case of light d might be a few microns (10~° m) whereas the distance, D, 
between the slits and the screen could be several centimeters, that is, D >> d. 
We therefore conclude that the two lines $,;,P and S,P are almost parallel. 
The situation of Fig. 12-94 can thus be approximated by that shown in Fig. 
12-9b. In Fig. 12-9b, a perpendicular has been dropped from slit S, to point 
Q on the line S,;P. Angles indicated by 6 are equal because their sides are 
mutually perpendicular. The extra distance S,Q traveled by the wave from 
slit S, is the path difference, with symbol 6, between the two waves when 
they arrive at P. It is seen from triangle S,5,Q that 


5 = dsiné (12.5) 


As we have shown in the preceding section, constructive interference at point 
P can occur only if this path difference is an integral multiple of wavelengths 
nv. The condition for constructive interference in this case becomes 


dsin@ = nA, wheren = 0,1,2,3,... (12.6) 


similarly, destructive interference will occur when the path difference is 
1/2 A, 3/2 A, 5/2 A, or in general [(2n + 1)/2] A, where n = 0, 1, 2,3,.... 


Condition for constructive 
interference 


d sin @ = nA 
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FIGURE 12-10 

Interference patterns in the ripple tank of Fig. 12-3. (a) 
and (b) If A is kept constant, the angular separation be- 
tween the interference maxima decreases when the sep- 
aration d between the sources is increased. (a) and (c) Both 
d and A are doubled, the angular separation between in- 
terference maxima remains the same. 
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The condition for destructive interference can be stated as 
2n+ 1 a : 
d sin 0 = A, wheren = 0,1,2,3,... (12.7) Condition for destructive 
interference 
We recall that as @ increases, sin 6 also increases. We conclude from Eq. 12.6 ae 2n + 1 
bud d sin @ = A 
that for a constant A, the angular separation between successive interference 9 


maxima decreases with increasing spacing between the slits, d. A simple 
laboratory demonstration of this is shown in Fig. 12-10. These are photographs 
of a ripple tank like the one used in the demonstration of Fig. 12-3. A com- 
parison of Fig. 12-10a and 12-10b shows that if A is kept constant, the angular 
separation between interference maxima decreases as the separation between 
the sources increases. Similarly a comparison of Fig. 12-10a and 12-10c indi- 
cates that when the separation between the sources is doubled, the wave- 
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length must also be doubled to have the same angular separation. It should 
also be clear from Eq. 12.6 that if d >> A, a large number of interference 
maxima will occur within a small angle and, as a result, the pattern will be 
difficult to observe. To illustrate the point, let us assume that d = 1000 J. If 
we consider a small angle 6, for example, 6 = 1°, we can solve for n in Eq. 
12.6, and we will get 


1000 A sin 1° = na 
therefore 
n = 1000 sin 1° = 1000 X 0.017 = 17 


Seventeen intensity maxima will be formed within an angle of 1°. It is clear 
that unless the distance from the slits to the screen is very large, the inter- 
ference pattern will not be observable. 

Although we will not derive the relation, it should be mentioned at this 
point that a derivation similar to that of the two-slits can be made for multiple 
slits of the same width and the same separation distance: Eqs. 12.6 and 12.7 
also apply for multiple slits. An opaque piece of glass with multiple slits is 
called a diffraction grating. 


12.5 SINGLE SLIT DIFFRACTION 


In the preceding discussion of double slit interference of light, we assumed 
that the two openings in the opaque screen acted as point sources. We will 
see that this assumption is correct if the size of the openings is smaller than 
the wavelength. If the opening is greater than the wavelength but of a size 
comparable to a few wavelengths, then light waves passing through different 
portions of a single slit will interfere with each other giving rise to a phe- 
nomenon known as single slit diffraction. 

The method used in Section 12.4 can be employed to analyze the dif- 
fraction of light by a single slit. Figure 12.11 illustrates schematically the 
passage of individual wavelets through a single slit of width d. The pattern 
seen on a screen to the right of the slit appears with a central bright maximum 
with alternating bright and dark fringes on either side, with the bright fringes 
diminishing in intensity as the angle from the normal increases. This is il- 
lustrated in Figs. 12-12a and b. 

In our treatment, we will assume that the size of the slit is much smaller 
than the distance from the slit to the screen. Under this condition, the lines 
of propagation of the waves emanating from different points in the slit are 
approximately parallel. In Figs. 12-11a and 12-122 all forward-directed waves 
strike the screen in phase, because the path difference is zero. These waves 
give rise to the central maximum. In Fig. 12-11b, wave A has a path difference 
of 4/2 from wave B and interferes destructively. This destructive interference 
effect occurs across the entire slit because any wave slightly below A will 
interfere destructively with the wave at the same distance below B and so 
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Photograph illustrating the dif- 
fraction of light by the fine edges 
of a razor blade. 
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(a) Bright (b) Dark 

FIGURE 12-11 
Single slit diffraction. As light passes through the slit, each point in the slit becomes 
a secondary source of light. (a) All forward-directed waves arrive at the screen in 
phase, giving rise to the central maximum of Fig. 12-12. (b) Waves from the upper 
half of the slit interfere destructively with those from the lower half, giving rise the 
first diffraction minimum in Fig. 12-12. (c) Waves between A and B interfere destruc- 
tively with waves between B and C, those between C and D also interfere destructively 
with those between D and E. This results in the second interference minimum in Fig. 
12-12. 


(c) Dark 


on. Therefore, a dark band will appear on the screen for angle 6; where 


sin 0, = : (12.8) 

d 
At angle 63 in Fig. 12-11c, the situation is similar to that of Fig. 12-11b in that 
all waves between A and B will destructively interfere with corresponding 
waves between B and C and all the waves between C and D interfere de- 
structively with those between D and E. Thus, a dark band is observed for 
the condition sin 0; = 2A/d. Somewhere in between these two minima a 


intensity 


(a) 
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FIGURE 12-12 

(a) Schematic representation of 
the interference pattern produced 
on a screen by light passing 
through a slit (b) Photograph of 
the interference pattern ob- 
served on the screen. (Source: 
Cagnet et al., Atlas of Optical Phe- 
nomena. New York, Springer- 
Verlag, 1971.) 
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FIGURE 12-13 

(a) Fringe pattern obtained with light shined through a double slit when the size of 
the slits is greater than but of the same order of magnitude as the wavelength. The 
pattern is essentially a double slit interference pattern, similar to that of figure 12-8), 
“modulated” by the diffraction pattern of Fig. 12-12. (b) Single slit diffraction pattern 
obtained when one of the slits is blocked. 


(0) 


maximum will be observed. We can generalize this result by stating that the 
diffraction minima occur for angles satisfying the relation 


sin 0 = ne, where n = 1, 2,3,... (12.9) 
From Eq. 12.8, we can see that if d = A, then sin 6, = 1 and 6, = 90°, which 
means that the central maximum spreads over the entire screen. The same 
will be true if d < A. This is what occurs when the screen is illuminated with 
a point source. We see, therefore, that an opening can be considered a point 
source when its size is equal to or smaller than the wavelength. Another 
important conclusion can be drawn from Eq. 12.9. This equation, which gives 
the position of the minima of single slit diffraction, is identical to Eq. 12.6, 
which gives the angular position of the interference maxima for the double 
slit case. We found then that if the separation between the slits, d, was much 
greater than the wavelength, the interference pattern would be difficult to 
observe. The same conclusion holds here if the size of the slit is much greater 
than A. 
The patterns of light on a screen from single and double slits are quite 
distinctive, as shown in Fig. 12-13. 


12.6 RESOLVING POWER 


We have seen in the preceding section that light effectively bends around 
corners. That is, when light shines on a slit the edges of the slit are not simply 
shadowed on the screen; there are also small bright lines within the shadow 
at angles away from the normal. This has profound implications for our 
determination of the location of a particle. We will show here that because 


(a) 

FIGURE 12-14 
Two light sources shining on a single slit give rise to two individual diffraction patterns. 
(a) The sources are sufficiently apart and the two central maxima are well resolved. 
(b) The sources are too close together and as a result the two sources cannot be resolved. 
The two central maxima overlap to the extent that a single intensity maximum (dashed 
line) appears on the screen. 


(0) 


of diffraction effects the accuracy of the determination of the precise location 
of a particle depends on the wavelength used to “‘look”’ at it; the smaller the 
wavelength employed, the greater the accuracy. In Chapter 19 we will show 
that the smaller the wavelength of the light used, the greater the momentum 
transferred to the particle being examined and, correspondingly, the greater 
the disturbance of its position in space. Thus, because of these conflicting 
effects there is a limit to which we may simultaneously know the location 
and momentum of a particle. This limit is known as the Uncertainty Principle. 
For now, we will show that the determination of location depends on the 
wavelength; this is called the resolving power. 

Suppose we have two sources of light, 5; and 52, shining on a single slit. 
If they are sufficiently far apart we will see on a screen two single slit patterns, 
each of the type of Fig. 12-12. This is illustrated in Fig. 12-14a. The light 
intensity at any point on the screen will be the sum of the contributions from 
each source. If we know the distance of the sources from the slit and the 
distance of the screen from the slit, we can calculate the angle A@ and the 
distance Ax between the two sources from the separation of the two central 
maxima. Suppose that the sources are closer together than in Fig. 12-14a and 
we have the arrangement of Fig. 12-14b. We see that the bright central maxima 
of the diffraction pattern from the two sources overlap so that, because it is 
the sum that is seen, it is difficult to estimate their distance apart and therefore 
difficult to know Ax. A rather arbitrary, although practical, criterion used to 
decide when the two sources S; and 5S, are just resolvable (i.e., considered 
as separate sources) is the coincidence of the central maximum of one of them 
with the first minimum of the other (see Fig. 12-15). This is known as the 
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FIGURE 12-15 

Rayleigh criterion. The separa- 
| tion between the sources is such 
that the central maximum 
produced by one source coin- 
cides with the first diffraction 
minimum from the other. The two 
- sources are barely resolvable. 


Rayleigh criterion or the limit of resolution. We saw in Eq. 12.8 that the first 
minimum occurs when sin 6, = A/d. Because this occurs for small angles, 


sin 0; ~ 6, (in radians) and limiting angle of 


a r resolution 
A@ = 6, (in radians) = limiting angle of resolution ~ 7 (12.10) 
A 
AQ == 
Thus, for high resolution—that is, small Ax (see Fig. 12-15)—one should have d 
a small A and a large slit width d. 


12.7 X-RAY DIFFRACTION BY 
CRYSTALS: BRAGG SCATTERING 


Waves are scattered or reflected by objects their own size or larger. A wave 
at the seashore will not be affected by a stick in the water but will be by a 
large rock or jetty. Atoms have sizes of the order of a few angstroms, 
1to3 x 10°’ m. Visible light has wavelengths of a few thousand angstroms. 
Therefore, visible light will not be affected by a single atom. Light is only a 
small part of the wavelength range of what is called the electromagnetic spectrum 
(Chapter 16). This spectrum ranges between y-rays and radio waves. The 
smallest wavelengths that we can conveniently produce are those of X rays, 
whose wavelengths are about the sizes of atoms. These are produced by 
bombarding a metal target with high-energy electrons. Their origin will be 
discussed in Chapter 17. 

In 1912, Max von Laue noted that in a crystalline solid the interatomic 
separation is of the same order of magnitude as the wavelength of X rays. 
He then showed that the regular, periodic arrangement of atoms in a crys- 
talline solid could be used as a three-dimensional diffraction grating (a diffrac- 
tion grating was defined at the end of Section 12.4). One year later, Sir William — oe 
Bragg presented a similar but simpler analysis of the problem. We will now _ Sir William Henry Bragg (1862 
present an outline of Brage’s analysis. 1942). 


X-RAY DIFFRACTION BY CRYSTALS: BRAGG SCATTERING 


Figure 12-16 is a planar representation of a three-dimensional cubic crys- 
tal, that is, a solid in which the atoms are located at the corners of unit cubes. 
The interatomic separation is d. When X rays strike the crystal at an angle 6 
with respect to a plane of these atoms, the atoms will scatter them in 
all directions. We will first limit ourselves to the X rays scattered specu- 
larly—that is, X rays for which the angle between the scattered beam and the 
plane of the crystal is the same as the one between the incident beam and 
the crystal plane, as shown at points A and B in Fig. 12-16. It is clear from 
the figure that the path difference between the X rays scattered by atom A 
and atom B is 2] because the ray scattered from atom B must travel that extra 
distance to rejoin the ray scattered from atom A. 

From geometric considerations, | = d sin 6, and the path difference is 
21 = 2d sin 6. By analogy to other interference experiments discussed in 
previous sections, if this path difference is equal to an integral number of 
wavelengths, the two beams will add constructively; that is, the radiation 
reflected by two adjacent layers of atoms will add constructively if 


2d sin 90 = nA wheren = 1,2,3,... (12.11) 


Equation 12.11 is known as the Bragg condition. Obviously, if the waves re- 
flected by the first layer are in phase with those reflected by the second layer, 
the same will be true for the waves reflected by the second and third, and 
so on. Thus the condition of Eq. 12.11 guarantees that the radiation reflected 
by all the atoms in parallel layers of the crystal at the same distance apart 
will be in phase. 

Thus far we have concentrated our attention on the waves that were 
scattered specularly. Is it possible to have an angle 6’, not necessarily equal 
to the angle of incidence 6, for which the scattered waves recombine con- 
structively? The answer is yes. However, if such an angle exists, it can be 
shown that a set of atomic planes exists, different from the ones that we have 
considered, such that with respect to them, the angle of incidence and the 
angle of scattering are equal. Furthermore, with respect to this new set of 
atomic planes, the Bragg condition (Eq. 12.11) is satisfied. This situation is 
illustrated in Fig. 12.17. To have constructive interference for the direction 
shown in Fig. 12-17, the condition 2d’ sin @ = nA must be satisfied. 

We should note that the Bragg equation is very similar to the double slit 
equation (Eq. 12.6). In the Bragg equation the interatomic spacing plays the 
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FIGURE 12-16 


Planar representation of a three- 
dimensional cubic crystal of in- 
teratomic spacing d. The X rays 
reflected by the dashed atomic 
planes will recombine construc- 
tively if the Bragg condition is 
satisfied. 


Bragg Condition 


2dasin @6= na 


FIGURE 12-17 
A different set of atomic planes 


(indicated by solid lines) in the 
crystal of Fig. 12-16 can produce 
constructive reflection if the Bragg 
condition with respect to those 
planes is satisfied. 
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same role as the separation between the slits in the double slit equation. 
Because the interatomic spacing is of the order of 10~'° m, we can see how 
a crystalline solid can be used as a diffraction grating for radiation of small 
wavelength, such as X rays or, as will be discussed in Chapter 19, matter 
waves. 


Example 12-1 


The crystal structure of silver bromide (AgBr) is represented in Fig. 12-18. 
The molecular weight and the density’ of AgBr are 187.80 g/mole and 
6.47 g/cm’, respectively. (a) Calculate the interatomic separation, d, of the 
atoms in AgBr. (b) If X rays of wavelength A = 1.50 A are incident on a AgBr 
crystal, at what angle will the first order (n = 1) diffraction maxima occur? 
Assume that the separation between the atomic planes producing the scat- 
tering is the interatomic spacing found in part (a). 


Solution 


(a) Let us consider a cube of AgBr 1 cm a side. In one row of the cube 
we have | cm/d (cm) atoms. Because we have as many rows of atoms 
in one plane as there are atoms in a row, we conclude that the number 
of atoms in one plane of the cube is I/d x 1/d or 1/d?. Finally, we 
have as many planes of atoms as we have atoms in a row; therefore 


1 cm? 


Number of atoms in 1 cm? = 7B 


where d is expressed in cm. 
The actual number of atoms can be found as follows: 


N of atoms/cm? = N of moles/cm? x N of atoms/mole 


6.47 g/cm? 


Se ee 
187.80 g/mole ™ 


where Na is Avogadro’s number, 6.02 x 10*° molecules/mole and 
the factor of 2 is included because there are two atoms (one Ag and 
one Br) per molecule. 
We can now write the following relation 
1 6.47 g/cm? 


— = ——— x 2 X 6.02 X 107 
7B 187.80 g/mole 6.0 0*? molecules/mole 


4.15 x 10% cm-* 


therefore 


d = 2.89 x 10-8 cm = 2.89 A 


‘Density is defined as mass per unit volume (see problem 13.1). 


FIGURE 12-18 
Example 12-1. 


Diffraction pattern produced 
when X rays are incident on a 
NaCl crystal. Each dot is 
produced by a set of atomic 
planes, satisfying the Bragg con- 
dition. 


(b) We can use the Bragg condition, Eq. 12.11, to find the angle at which 
the first-order (n = 1) diffraction maximum is observed. 


2d sin@d=A 
Sl pak 
in ~ 94 
— 150A 
2x 2.89 A 
= 0.26 


6 = sin” ' 0.26 = 15° 


12.8 STANDING WAVES 


Another interesting phenomenon resulting from the superposition principle 
is the formation of standing waves. 

In Chapter 11, when we discussed traveling waves in a string, we im- 
plicitly assumed that once the waves were set up at one end, they continue 
traveling toward the right forever. This is a correct assumption if the string 
is infinitely long. Consider now that the string is of finite length and the other 
end is clamped te a rigid support. When the wave disturbances reach the 
fixed end, they will propagate in the opposite direction. The reflected waves 
will add to the incident waves according to the superposition principle and, 
under certain conditions, a standing wave pattern will be formed. 

If we assume that the incident waves y; travel toward the right in the 


positive x direction, from Eq. 11.5 we can represent them as 
y, = Asin (kx — at) (11.5) 


The reflected waves y2 will be traveling in the negative x direction and from 
Eq. 11.7 are given by 


Y2 = Asin (kx + owt) (11.7) 
The resulting wave pattern will be 
y = yy; + yo = A [sin (kx — at) + sin (kx + at)] 
Using the trigonometric relation of Eq. 12.1, we obtain 
y = 2A sin kx cos wt (12.12) 


Equation 12.12 is the equation of a standing wave. We note that, as in the 
case of a traveling wave, the particles in the string execute simple harmonic 


motion with the frequency of the wave v = - . Unlike the case of the traveling 
7 


wave, however, the amplitude of oscillation is not the same for all the points 
(all values of x in Eq. 12.12) in the string. In particular, there are certain points 
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When this flute player blows on 


the mouthpiece standing waves 
are set up inside the flute. 
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for which the amplitude of oscillation (the coefficient of cos wt) will be zero. 
These points, called the nodes, are those for which sin kx = 0. We can locate 
these nodes and at the same time find the conditions for standing wave 
formation by requiring that the value of the wave, y, be zero at the clamped 
end of the string. If the length of the string is /, then y(x = /) = 0. Substituting 
this in Eq. 12.12, yields 


0 = 2A sin kl cos wt (12.13) 
Because Eq. 12.13 must be satisfied for all times t, we conclude that 
sin kl = 0 
or 
kl = a, 2m, 37, °° 117 (12.14) 
where n is an integer. Note that kl = —a, —2a, —37,... will also yield a 


zero value for sin kl. However, these negative values correspond to negative 
values of k and hence of the wavelength A and therefore are not physically 
acceptable. Substituting Eq. 11.12 for k in Eq. 12.14, we obtain 


27 
—_ = "nT 
Xv 
or 
2 
A= a (12.15) 


This result tells us that the wavelength of the standing wave cannot be ar- 
bitrary as was the case with the traveling wave. It can only have the values 
21, 2/2 1, 2/31, 2/41, . . . . Aschematic of a photograph of the first few standing 
wave patterns is shown in Fig. 12-19. 


PROBLEMS 


FIGURE 12-19 

Configuration of the first four 
standing waves in a string of 
length 1. 


12.1 Two sources emit waves of the same fre- 


12.3 In the double slit of problem 12-2, what is the 


quency, wavelength, and amplitude. What is the 
amplitude of the resulting wave at a point P ata 
distance x, from source S, and a distance x2 from 
source S, if x; — X2 is (a) one wavelength? (b) one- 
half wavelength? 


12.2 Two slits in an opaque screen are separated 
by a distance d = 10°° m. Light of frequency 
vy = 5 X 10 Hz is shone through the slits. Find the 
angular position of the first three interference max- 
ima. 


angular separation between the first interference 
maxima for two waves of wavelength A; = 6000 A 
and Az = 4000 A? 


12.4 Two slits separated by a distance d = 4 xX 
10-° m, are 1.5 m away from a screen (see Fig. 
12-20). What is the separation y2 — y, between the 
first and the second interference maxima if light of 
wavelength A = 5000 A is sent through the slits? 
(Answer: 1.88 cm.) 


FIGURE 12-20 
Problem 12.4. 


12.5 Light from a sodium lamp contains waves with 
wavelength A; = 5880 A and waves with Az = 5890 A. 
Find the angular separation and the linear separa- 
tion of the two wavelengths on a screen 50 cm away 
from a double slit for the first-order maxima. Do the 
calculations for a slit separation of 70,000 A and for 
a slit separation of 7000 A. 


12.6 In a double slit experiment performed with 
light of wavelength A = 5400 A, the separation be- 
tween the tenth interference maximum and the cen- 
tral maximum on a screen 150 cm away is 10 cm. 
What is the spacing between the slits? 


12.7. Two speakers separated by a distance of 3 m 

emit sound waves of frequency v = 550 Hz. The 

velocity of sound is 330 m/sec. Find the position of 

the points along the line 5;O in Fig. 12-21, at which 
the intensity of the sound will be a maximum. 

(Answer: 7.20 m, 3.15 m, 1.60 m, 0.68 m, 

0 m from $;.) 


—e 


FIGURE 12-21 
Problem 12.7. 


12.8 A source of waves 5 and a detector D are lo- 
cated 8 m apart (see Fig. 12-22). A horizontal reflect- 
ing surface is placed 3 m above the source and the 
detector. The direct wave from S to D is found to 
add constructively with the reflected wave. When 
the reflecting surface is raised an additional distance 
of 0.204 m, the direct and the reflected waves add 
destructively at D. What are the possible values of 
the wavelength A? 

(Answer: 0.500 m, 0.167 m, 0.100 m, 0.071 m,. . .) 
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FIGURE 12-22 
Problem 12.8. - : . 


12.9 Monochromatic (single wavelength) light is 
directed on a double slit. A light meter is placed to 
the right of the slits in the position shown in Fig. 
12-23. When slit S, is closed, the light intensity at 
the location of the meter is I,. When slit S, is closed 
the light intensity is I,. (a) What is the light intensity 
I; when both slits are open if x; — x2 = A? (b) What 
is Ip if x; — x2 = 2 A? |, and I, are not necessarily 
equal. Assume that the size of the slits is smaller 
than the wavelength so that the slits can be consid- 
ered to be point sources. 
(Answer: (a) 1, + IL, + 2 (I,L)'”, 
(b) I, + I, — 2 (I,L)”.) 


FIGURE 12-23 
Problem 12.9. 
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12.10 Light of wavelength A = 5000 A is incident 
ona single slit of width 107° m. What is the angular 
separation between the central maximum and the 
eighth-order diffraction minimum? 


12.11 Light of wavelength 6000 A is sent through 
a single slit. If the angular separation between ad- 
jacent diffraction minima is 0.2°, what is the width 
of the slit? (For small angles, sin 6 = 6 (in radians)). 

(Answer: 1.72 x 10°* m.) 


12.12 Ascreen is placed 2 m to the right of a single 
slit of unknown width. Light of wavelength 5200 A 
is incident on the slit from the left. The separation 
on the screen between the second-order minima on 
either side of the central maximum is 5.2 cm. What 
is the width of the slit? 


12.13. Monochromatic X rays of wavelength A = 
1.2 A are incident on a crystal. The first-order dif- 
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fraction maximum is observed when the angle 6 be- 
tween the incident beam and the atomic planes is 
12°. (a) What is the separation of the atomic planes 
responsible for the diffraction? (b) What is the high- 
est order Bragg diffraction produced by those planes 
that can be observed? 

(Answer: (a) 2.89 A, (b) 4th.) 


12.14 Sodium chloride (NaCl) has a crystal struc- 
ture similar to that of silver bromide (AgBr) shown 
in Fig. 12-18. The atomic weight of NaCl is 58.44 
g/mole and its density is 2.16 g/cm’. (a) Calculate 
the spacing between the atoms in a NaCl crystal. 
(b) If X rays of wavelength 1.5 A are incident on a 
NaCl crystal, at what angle @ will the first order 
diffraction maximum be observed? 

(Answer: (a) 2.82 A, (b) 15.4°.) 


12.15 Potassium chloride (KCl) has the crystal 
structure of AgBr in Fig. 12-18. The molecular weight 
and the density of KCl are 74.55 g/mole and 1.98 
g/cm?, respectively. The distance between adjacent 
atomic planes is 3.14 A. (a) Calculate Avogadro’s 
number from this data. (b) If the first-order diffrac- 
tion maximum for X rays incident on these atomic 
planes is observed when the angle 0 between the 


incident direction and the crystal planes is 30°, what 
is the wavelength of the X rays? 


12.16 The wave velocity in a string 1 m long is 
6 m/sec. What are the frequencies of the standing 
waves in the string? 


12.17 A standing wave in a string is described by 
the equation 


y = (0.7 m) sin (47x) cos (207) 


where x is in meters and t¢ is in seconds. (a) What 
is the amplitude and the velocity of propagation of 
the traveling waves that gave rise to such a standing 
wave? (b) What is the amplitude of vibration of the 
particles in the string located at x = 0.45 m? (c) What 
is the transverse velocity of the particles at x = 
0.45 m at t = 0.25 sec? (d) What are the locations 
of the nodes? | 
(Answer: (a) 0.35 m, 5 m/sec, (b) 0.41 m, (c) 0, 
(d) 0 m, 0.25 m, 0.50 m, 0.75 m.) 


12.18 A standing wave of frequency v = 10 Hz is 
set up in a string of mass m = 0.100 kg and length 
1 = 2m. The maximum amplitude of vibration is 
5cm. What is the total energy of the standing wave? 
Assume that A = I. (Answer: 0.25 J.) 
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13.1 INTRODUCTION 


In this chapter we begin the study of electricity. Although we ultimately wish 
to understand the flow of electric charges through electrical circuits, we must 
start with the simple empirical laws of the interaction of charges at rest, called 
electrostatics. Our starting point will be the observed behavior of electric charges 
at rest and how careful observations by Coulomb led him to postulate laws 
of the behavior of charges at rest in their interaction with one another. We 
will also examine the superposition principle according to which the behavior 
of multiple charges on one another is a simple sum of the one-to-one interactions 
(pairwise). 


13.2 ATTRACTION AND REPULSION OF CHARGES 


Everyone has experienced some of the phenomena of static electricity. When 
you comb dry, clean hair, it is attracted to the comb. For a short time afterward 
the comb will attract small particles such as little pieces of paper. These 
phenomena have been known for thousands of years. The ancient Greeks 
noticed that if a piece of amber (fossilized tree sap) were rubbed with a piece 
of cat fur, it would attract pieces of dry leaves. In fact, the Greek word for 
amber is elektron. 

These electrical phenomena fascinated many early investigators, and a 
useful device called the electroscope was invented about 200 years ago to further 
the studies of these phenomena. An electroscope is simply two very thin gold 
leaves attached together at the end of a metal rod, usually with a metal knob 
at the other end. The leaves are enclosed in a protective glass-windowed case 
(see Fig. 13-1). Gold was used because it is a soft metal that can be beaten 
very thin. Therefore, the foils have very little mass and not much force is 
required to push them apart. 

If an amber rod (hard rubber or a synthetic polymer will also serve) is 
rubbed with cat fur and brought close to the metal knob, it is seen that the 
gold leaves separate into a wide angle as if they are trying to get away from 
each other. Indeed they are, and their behavior is termed “‘repelling’’ one 
another. If the rod is pulled back from the knob, the leaves collapse again 
into the downward position. If, however, the rod is touched to the knob 
before it is removed, the leaves remain apart, in the repelling position. This 
suggests that something has flowed from the rod to the leaves, and we now 
know that it is an electric charge. If a glass rod is rubbed with a piece of silk, 
the same phenomena will occur. However, if the electroscope is first charged 
with the amber rod and then touched very briefly with the glass rod, it will 
discharge; that is, the leaves will fall back down. If the glass rod is in contact 
for a longer period, it is seen that the electroscope will first discharge and 
then recharge; that is, the leaves will again move apart. This experiment can 


FIGURE 13-1 
Electroscope. 


be done in the reverse. That is, the electroscope charged by the glass rod can 
be discharged and recharged with the amber rod. These phenomena were 
explained by the assumption that there are two different types of charges 
arbitrarily called positive and negative: one type was produced on rubbing 
amber with cat fur and the other by rubbing glass with silk. This is still the 
model today, and we speak of positive and negative charges. In other words, 
we now Say that the electron has a negative charge, but this is a result of 
history: its charge could just as easily have been called positive without af- 
fecting its behavior. It was also noted in these studies that the rod in the 
electroscope had to be made of metal. No effect would be observed if it were 
made of wood, rubber, or some other nonmetallic material. This implies that 
electrical charges can flow through metal but not through nonmetals, such 
as the ones just named. Thus metals have been known as electrical conductors 
and nonmetals as insulators. In Chapters 24 and 25 we will analyze the dif- 
ference between these two types of materials and introduce the semiconductor, 
a material with electrical properties that lie between those of a conductor and 
an insulator and whose characteristics form the basis of the modern computer. 


13.3 COULOMB'S LAW 


Charles Coulomb (1736-1806) published between 1786 and 1789 the results 
of a series of experiments that he had performed. Instead of using the elec- 
troscope in which the force between the gold leaves was difficult to measure, 
he used very small lightweight balls on the ends of long threads. The balls 
were made of the centers of dried reed stems called pith and were small so 
that they could approximate point charges. If two were suspended adjacent 
to each other and both touched with either the amber or glass rod, they would 
repel each other as did the leaves of the electroscope. He therefore confirmed 
that like charges repel. If he had touched both with the amber rod and achieved 
repulsion and then touched one with the glass rod they would come together. 
This experiment could be reversed in that if the repulsion were first achieved 
by touching both with the glass rod, touching one with the amber rod would 
cause them to come together. He therefore confirmed that unlike charges attract. 

Before continuing with the work of Coulomb, we can use these two 
findings to interpret the electroscope observations. Accepting that there are 
two types of charges and that at least one of these can move in a metallic 
conductor but not in an insulator, we may view the sequence of diagrams in 
Fig. 13-2. In Fig. 13-2a, the negatively charged amber rod is brought near the 
electroscope and the mobile negative charges in the metal rod, being repelled, 
go to the ends of the gold leaves, their maximum distance from the amber 
rod. The gold leaves now both have an excess of negative charge and repel 
each other. Although at Coulomb’s time it was believed that the positive 
charges were mobile, we now know that this is not generally true and it is 
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Charles Coulomb (1736-1806). 
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FIGURE 13-2 


the negatively charged electrons that move in the metal. The same effect 
occurs in (b) with a positively charged glass rod, for it does not matter if the 
positive charges in the leaves are there because they have been repelled by 
the glass rod or because the electrons have been attracted to it, thereby leaving 
a net excess of positive charge on the leaves. In (c), the amber rod is touched 
to the electroscope. Because the upper end of the metal rod, as shown in (a), 
is positively charged, electrons will flow from the amber to the metal rod. 
When the amber rod is removed in (d), the electroscope has a net excess of 
negative charges throughout and the leaves therefore repel each other. In (e), 
a positively charged glass rod is touched to the negatively charged electro- 
scope of (d) and first the excess electrons leave, causing the leaves to collapse 
because they now have no excess charge, that is, they are neutral. If this 
positively charged glass rod has a sufficient amount of charge and it is rubbed 
further against the metal rod of the electroscope, more electrons will leave 
the electroscope, resulting in an excess of positive charges in the electroscope. 
Therefore the leaves repel each other as in (f). 

We now return to Coulomb’s experiments concerning the forces between 
charges. Let us represent a quantity of charge, a scalar, by the letter q and 
the distance between charges by the letter r. Coulomb, in his experiments, 
attached fine threads to the pith balls and passed them over glass rods, which 
effectively served as frictionless pulleys. To the ends of these threads he 
fastened various weights (see Fig. 13-3a). If the charges g,; and q2 placed on 
the pith balls from the amber and glass rods were of opposite sign, the balls 
would be attracted to each other by a force F, which would be countered by 
an appropriate weight Mg. The force diagram for q2 is shown in Fig. 13-3b. 
Note that the tension T> in the horizontal thread is equal to the hanging 
weight Mg. We have here the equilibrium problem of Chapter 4 where the 
forces to the left are T,; cos @ plus the electrostatic attraction, and these are 
equal to Mg (T2 = Mg). By measuring the weights Mg required to hold the 
balls apart a distance r, Coulomb found that the magnitude of the attractive 
force was proportional to the reciprocal of the square of the distance between 
the balls; that is, 

1 


Fa— 
ye 


(b) 
FIGURE 13-3 

(a) Experimental arrangement for 
the determination of Coulomb's 
law. (b) Force diagram for q. 


He then put the same type of charge on each ball and draped the threads 
over the rods in the opposite direction to measure the force required to hold 
the balls at various distances r from each other. He found that the force of 
repulsion also varied inversely as the square of the distance between them. 
Having established this relation, he was then able to hold them at a fixed 
distance and vary the charges on them. He could do this accurately by frac- 
tionating the charges. He had a set of metal knobs on wooden sticks (insu- 
lators). If he touched one of these metal knobs to an amber or glass rod it 
would acquire a charge of magnitude q. If he then touched this knob to an 
identical uncharged one they would share the charge equally and each would 
have a charge q/2. Touching either of these to another would reduce the charge 
to g/4, and so on. Touching two knobs in contact with one another would 
produce a charge of g/3. Many combinations of these fractions were used to 
charge the pith balls at a fixed distance from one another. Coulomb showed 
that the force of attraction between oppositely charged balls or of repulsion 
between balls with charges of the same sign was proportional to the product 
of the magnitude of the two charges, g1 q2. Coulomb’s law is therefore 


94142 
F x aa (13.1) 
where the sign of q; and gz may be either plus or minus. This proportionality 
can be made into an equality by introducing a constant dependent on the 


system of units used. In the SI system this constant, taken as 1/47re,, has the 
value 


= 9 X 10? N-m?7/C? 


47TE_ 


The symbol C stands for Coulomb and is the unit of charge. It is important 
to note at this time that 1 C is not the charge of an electron. The charge of 
the electron in coulombs is e = —1.6 x 10°C. 

Equation 13.1 can now be written as 


pa bh (13.2) 


ATE e 


The direction of the force that q; exerts on q2 is along the line joining the two 
charges, pointing away from gq, if the force is repulsive (q, and q2 having the 
same sign) or toward q; if the force is attractive (gq; and q2 oppositely charged). 


Example 13-1 


Two pith balls of mass 0.1 g each are suspended on 50-cm threads. They are 
given equal charges and assume a position in which each makes an angle of 
20° with the vertical, as in Fig. 13-4a. What is the charge on each? 


Solution The vector diagram of the forces on the right-hand ball is shown 
in Fig. 13-4b, where F is the coulombic force of repulsion between the two 
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charged pith balls. Because the ball is in equilibrium, we may write 


LF, = 0 

F — T cos 70° = 0 
F = 0.34 T 
LF, = 0 


mg 
sin 70° 


_ 0.1 x 10>? kg x 9.8 m/sec 
7 0.94 


T = 1.04 x 10°°N 


Substituting this value of T in the equation for F, we obtain 
F = 0.34 T = 0.34 x 1.04 x 10°°N = 3.5 x 10°4N 


From Fig. 13-4a, the distance r between the two balls is 


r = 21 sin 20° 
4 (b) 
r= 2 xX 0.5m X sin 20 FIGURE 13-4 
r = 0.34 m Example 13-1. 
Using Coulomb’s law, 
N-m? q? 
F =9 x 10? @ 2 
because 
qi = 2 
and substituting for F and r we obtain 
| 9 x 10° _ qr? 
3.5 x 10°*N = —.—_>— 
(0.34m)? 


Or 


q= 6.7 x 10°°C 


13.4 CHARGE OF AN ELECTRON 


In a series of experiments, Robert Millikan (1868-1953) in the years 19099 = =\ 
through 1913 measured the charge on an electron (see Fig. 13-5). With a spray * ee ay 

he introduced fine oil drops between two parallel metal plates and observed 
the motion of a single drop through a telescope. By its rate of fall through Robert Millikan (1868-1953). 


the air he was able to use a formula for the terminal velocity (constant rate 
of fall through a medium) to estimate its weight. He found that he could 
arrest its downward motion, that is, hold it stationary by placing a positive 
charge on the upper plate. (We will see later that he could control the positive 
charge on the plate by means of the voltage.) Therefore, the balance of forces 
of mg down and the upward attraction could be used to determine the charge 
on the drop. First, he found that the drops always acquired a negative charge, 
never positive. This showed that it is the negative charge that is apparently 
the more mobile one. His second finding, over hundreds of experiments, was 
that the smallest charge that was ever acquired by the drop had a magnitude 
of 1.6 x 10~'° C and that larger charges were always integral multiples of 
this quantity. He therefore assigned this value to the charge of the electron; 
it is the smallest negative charge that can be found. Because atoms are neutral 
and contain equal numbers of electrons and protons, the charge of the heavy 
and essentially immobile proton also has this magnitude, but it is positive. 
We will see that the protons are in the nucleus and are relatively massive. It 
is therefore understandable why it is the negative charge that is the mobile 
species rather than the positive one. This was not known prior to Millikan’s 
experiment. The great theories of electrical behavior were developed in the 
nineteenth century when it was assumed that the positive charge was the 
mobile species. We will see that all electrical definitions are based on the 
behavior of a unit positive charge. This sometimes leads to the confusion of 
students; but, as mentioned earlier, the assignment of the words positive and 
negative are completely arbitrary, and therefore the theories remain valid. 


13.5 SUPERPOSITION PRINCIPLE 


Coulomb’s law, defined in Section 13.3, relates to the force between two 
charges. It is an empirical law derived from experimental measurement. How 
does one treat a situation in which three or more charges are involved? To 
answer that requires further experimentation. It is found that the force be- 
tween any two charges in a group of charges is independent of the presence 
of the other charges. What this means is that if one selects a given charge in 
a group and asks for the total force on it, this force would be the resultant 
of the individual vector forces on it from each of the charges. This is called 
the superposition principle of charges. It makes the calculations straightforward 
because we can treat them as we did in summing vector forces in Chapter 2. 
Let us examine the case of three charges in Example 13-2. 


Example 13-2 


Three charges are arranged in a triangle as shown in Fig. 13-6a. What 
is the direction and the magnitude of the resultant force on the 1 x 10°° C 
charge? 


SUPERIMPOSITION PRINCIPLE 


183 


FIGURE 13-5 
Experimental arrangement tor the 


determination of the charge of the 
electron. 


184 ELECTROSTATICS 


FIGURE 13-6 
Example 13-2. 


Solution: The force resulting from the 4 x 1078 C charge by Coulomb’s 
law is 
N-m? 1 x 10°°C x 4 x 10°78 C 7 
Fi =9 x 10°-a aC x 10°7N 


at 30° above the positive x axis (see Fig. 13-6b). 
The force resulting from the 2 x 107° C charge is 


N-m? 1 x 10°C x 2 x 10°78 C 


a eee = 1.8 x 10-4N 


F, = 9 x 10? 


at 30° below the positive x axis (Fig. 13-6b). 
We now use the vector diagram of these two forces, Fig. 13-6b, and find 
the resultant by the component method of Chapter 2. We have 


F, = 3.6 x 10°*N cos 30°i + 3.6 X 10°4N sin 30° j 

F, = 1.8 x 10°*N cos 30°i — 1.8 x 10°74 N sin 30° j 

R = 4.7 x 10-4 Ni + 0.9 x 10-4Nj 

IR| = V(4.7 x 10-4 N)? + (0.9 x 1074 N)? = 4.8 x 1074N 


= arctan = 10.8° above the positive x axis 


The direction of the resultant force vector is indicated by the arrow labeled 
R in Fig. 13-62. 


PROBLEMS 

13.1 Two particles of charge q, = +2 xX 10°? C 13.2 A particle of charge gg; = —2 X 10~° C is 
and q2 = +3 x 10~? Care placed 0.04 m from each placed midway between the two charged particles 
other. What is the force of repulsion that each ex- of problem 13.1. What is the net force on it and in 


periences? what direction? 


13.3. At what position between particles 1 and 2 of 
problem 13.1 will particle 3 of problem 13.2 experi- 
ence no net force? 

(Answer: 0.018 m from qi, between gq; and q>.) 


13.4 Three charges lie on the x axis as in Fig. 13-7. 
Find the resultant force on the middle charge, qp. 


Y 


FIGURE 13-7 = +2 % 107°C _Y ‘a q- 4x 107°C 
Problem 13.4. dap Z 


13.5 An iron atom of mass 9.32 x 10~*° kg has 26 
electrons. The density of iron is 7.86 g/cm?®. If two 
identical iron balls each of volume 1 cm? were stripped 
of all their electrons and placed 1 m apart, (a) What 
would be the electrostatic force between them? (b) 
What would be the gravitational force between them? 
(c) Compare these forces. 


13.6 Two particles of mass 5 kg each are given an 
equal amount of charge. (a) What must the charge 
be on each particle so that the gravitational attraction 
exactly balances the electrostatic repulsion? (b) How 
many electronic charges does that charge corre- 
spond to? 


13.7. Two threads of length 0.7 m support balls of 
mass 0.2 gm as in Example 13.1. Equal charges of 
the same sign are put on the balls and they repel, 
each making an angle of 30° with the vertical. What 
is the charge on each ball? 

(Answer: 2.48 X 1077 C.) 


13.8 A charge q is to be shared by two particles. 
What must be the charge on each particle so that 
the force between them, for a fixed separation, is a 
maximum? 


4d 


A 
(Answer ° 


13.9 An electric dipole consists of two charges of 
equal magnitude g but of opposite sign separated by 
some distance d. The electric dipole moment is de- 
fined as uw. = gd. Consider an electric dipole lying 
on the y axis as in Fig. 13-8. What is the force exerted 
by the dipole on a charge q’ located on the x axis at 
a distance x from the origin? 
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| 
FIGURE 13-8 q 
Problem 13.9. | 


13.10 Four charges are located at the corners of a 
square as shown in Fig. 13-9. (a) What is the re- 
sultant force on q2? (b) What should q, and gq, be so 
that the resultant force on q2 is zero? 


g = +1x1078c g '2 x 107" € 


FIGURE 13-9 
Problem 13.10. 


13.11 Two charges, q, = 3 x 10°° C and gq = 
-3 x 107° C, are connected by an insulating rod 
10 m long. The rod is pivoted about its midpoint. 
The rod is kept horizontal 10 cm above the floor. 
Two identical charges, g = 5 X 107° C, are fixed 
directly below g, and qg2, as shown in Fig. 13-10. 
Where should a 3-kg weight be placed on the rod 
to keep the rod horizontal? 


FIGURE 13-10 
Problem 13.11. 


13.12 A fixed conducting ball has a charge q; = 
3 xX 10°°C. An identical ball with charge q2 is held 
at a distance x away from q,. The two balls attract 
each other with a force of 13.5 N. The balls are then 
connected by a conducting wire. After the wire is 
removed, the balls repel each other with a force of 


186 ELECTROSTATICS 


0.9 N. (a) What was the charge q, of the second ball? 
(b) What is the separation x between the balls? 
(Answer: (a) —5 X 107°C, (b) 0.10 m.) 


13.13, Ana particle (g = 3.2 x 107” C) is projected 
from far away directly toward a gold nucleus (q’ = 
79 x 1.6 x 10°”C). The mass of the a particle is 6.7 
x 10°” kg and its initial velocity is 4 x 10° m/sec. 
Use the work-energy theorem of Section 5.4 (Eq. 5.9) 
to calculate the closest distance of approach of the a 
particle to the nucleus. Assume that the gold nucleus 
remains stationary. (Hint: In this case the force on the 
a particle is not constant. Therefore, the integral form 
for work, Eq. 5.7’, must be used to evaluate the work 
done on the a particle.) 

(Answer: 6.8 x 107” m.) 


13.14 Two positive charges g are held fixed and are 
separated by a distance 2a. A third positive charge 
q' of mass m is initially placed halfway between them 
(Fig. 13-11). q’ is then displaced a small distance x 
(x << a) and released. (a) Show that the force on q’ 
is approximately proportional to x and in the op- 
posite direction of the displacement x. (b) The force 
on g’ is therefore similar to the force exerted by a 
spring on a block connected to it (Chapter 10). What 
is the period of oscillation of q’? 

TE, onl ' 


(Answer: 27 : 
q 4 


i q q 
FIGURE 13-11 a——___@_e___e 
Problem 13.14. | 
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the Electric Potential | 


188 THE ELECTRIC FIELD AND THE ELECTRIC POTENTIAL 
14.1 INTRODUCTION 


We have seen in the preceding chapter how the presence of an electric charge 
has an effect on another electric charge. This raises the question: What if there 
is only one electric charge present? The idea of an electric field is introduced 
to describe the effect in all space around a charge so that if another charge 
is present we can predict the effect on it. If we have multiple charges, such 
as in Example 13-2, we see that the force of each on a third charge is a vector, 
and the net effect on the third charge is the resultant of the forces. This 
resultant will differ with both the position and the charge of the third one. 
The concept of separating the calculation into the formation of an electric field 
and the response to the electric field by a given charge placed in it greatly 
simplifies the calculations. 


14.2 THE ELECTRIC FIELD 


If, in Example 13-2, we had wished to find the resultant force on a charge of 
a different magnitude in the same position, we would have to repeat the 
calculations. However, after a few such repeat calculations we would notice 
that all one has to do is multiply the first result by the ratio of the magnitude 
of the new charge to that of the charge used in the first calculation. If the 
first calculation has been made for a test charge of +1 C, the task will be 
easier, for then the ratio of the magnitude of the new charge to that of the 
charge of the first calculation is simply the magnitude of the new charge. In 
other words, let us simply define a new quantity, called the electric field, with 
symbol €, at a point in space as the vector resultant force experienced by a 
positive test charge of magnitude 1 C placed at that point. If an arbitrary test 
charge q’ is placed at that point, the charge will experience a force 


= q'é (14.1) 


Thus, the electric field at a point in space can be calculated by measuring the 
force experienced by a test charge q’ and dividing it by the magnitude of the 
test charge; that is, 


F 
€=— (14.2) 
q 


If we consider two charges, g and gq’, separated by a distance r, from 
Coulomb’s law (Eq. 13.2) the magnitude of the force between them is 


log 
F=-—-—-> 13.2 
Amey 1 ( ) 


Let us arbitrarily consider q' as the test charge and gq as the charge creating 
the electric field at the point P where q’ is located. On substitution of Eq. 13.2 


in Eq. 14.2, the magnitude of the electric field produced by q at P is given by 


1 i 
€ = At 
Ameo q'r 
or 
1 4 
6 = = 
Amey 17 eo) 


Equation 14.3 is the general expression for the electric field resulting from a 
charge q at a point located a distance r away from the charge. 

The direction of 6 can be deduced from its original definition as the force 
experienced by a unit positive charge. If the charge g producing the field is 
positive, then a positive test charge of 1 C, when placed at P, will be repelled. 
We conclude that the electric field produced by a positive charge q at a point 
P is along the line joining the charge q and the point P and directed away 
from g. It may be said that the electric field is directed radially away from a 
point charge q (see Fig. 14-1a). On the other hand, a negative point charge g 
will attract the 1-C positive test charge and consequently the electric field that 
it produces is directed radially toward it (see Fig. 14-10). 

We have indicated (Section 13.5) that Coulomb’s law obeys the super- 
position principle. From the definition it follows that the electric field does 
too. The field produced by a group of charges is simply the vector sum of 
the fields produced by the individual charges. 

An important point to mention here is that not only is there no electric 
field when there are no charges, but there is no electric field at a point when 
the forces from an assembly of charges on a test charge is zero at that point. 
Also, if an array of equal numbers of positive and negative charges are located 
in a small region, then at some distant point (distant relative to the distance 
between the charges) there is no measurable electric field. This is why atoms 
when they are far away from each other, such as in a dilute gas, experience 
no measurable electric fields. However, the electrons that make up the atom 
experience the electric fields of the nucleus and of the other electrons. Fur- 
thermore, if a charged particle is shot at an atom, as in a nuclear experiment, 
it will get close enough to experience the internal electric fields of the atom. 


Example 14-1 


A charge q, = 3 X 107° C is located at the origin of the x axis. A second 
charge gg = —5 Xx 10~° C is also on the x axis 4 m from the origin in the 
positive x direction.- (a) Calculate the electric field at the midpoint P of the 
line joining the two charges. (b) At what point P’ on that line is the resultant 
field zero? 


Solution 


(a) Because q; is positive, its electric field €, at P is away from it, that 
is, in the positive x direction. The electric field €, produced at point 
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(b) 
FIGURE 14-1 
Direction of the electric field set 
up at point P: (a) by a positive 
charge g, and (b) by a negative 
charge —q. 
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€. p 82 Ese 
J # a eT, $f? O_, x 
—__.— a, ) % FIGURE 14-2 
i x <—em——2m— Example 14-1. 


P by q2 is toward q2, that is, in the same direction as €, (see Fig. 
14-2). From Eq. 14.3, the magnitudes of €; and ©, are 


N-m’? 3 x 107°C 


18) =9 x 10° = 6.75 x 10° NIC 


Cc (2m) 
2 —6 
[S| = 9 x 10° <3 ae = 11.25 x 10° NIC 


Because it is seen in Fig. 14-2 that both @, and ©, are directed along 
the positive x direction, the resultant electric field € at P will be 


& = §& + & = 6.75 x 10° N/C + 11.25 x 10° N/C 
18 x 10° N/C 


From part (a), it is clear that the resultant € cannot be zero at any 
point between q, and gq. because both €, and €, are in the same 
direction. Similarly € cannot be zero to the right of qz because the 
magnitude of q2 is greater than q, and the distance r in Eq. 14.3 is 
smaller for qz than qg,. € can only be zero to the left of g; at some 
point P’ to be found. 


(b 


ww 


€ = €, + € = 0 
6, = -€ 


and 


Or 


Amey x2 Aateg (x + 4) 
3(x + 4)? = 5x? 
2x* — 24x — 48 = 0 
x = 13.75m, x = —-1.75m 


The second root of the quadratic equation, x = — 1.75 m, represents 
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a point between the charges at which @, = € 2. However, as indicated 
earlier, between the charges, 6, and @2 have the same direction and 
consequently the resultant field is not zero. 


14.3 ELECTRICAL POTENTIAL ENERGY ~—$<_<—<—<_- 0); 


We now wish to develop an expression for the amount of work required to 
move a charge in an electric field. First we note that the magnitude of the 
electric field at a point P resulting from a point charge is independent of the 
angular position of the point P, because only distance enters into Eq. 14.3. 
In the preceding section we also showed that the direction of the electric field  eyaupe y4-2 
is radially away from the charge producing the field if the charge is positive Two possible paths for bringing 
or radially toward it if the charge is negative. Thus, the direction of the electric a charge q’ from point A to point 
field from a positive point charge may be represented by simply drawing  B. 
arrows out from it in all directions. Figure 14-3 shows such a schematic in 
two dimensions only. If we move a positive test charge q’ from point A to 
point B, we have a situation similar to that of Chapter 5 where we showed 
that the work against a gravitational force is independent of the path. We 
must recognize that in this case the force is not constant. In Fig. 14-3 work 
must be done whenever the radial distance between the moving charge, q’, 
and that which creates the electric field, q, is changed. However, when q’ 
moves tangentially, no work is done because the direction of motion is per- 
pendicular to the electric field and therefore to the force acting on q'. Recall 
that by definition work involves the dot product of the force vector F and 
displacement vector As, that is, W = F- As (Eq. 5.3). In Fig. 14-3 the same 
amount of work is done in moving a charge from point A to point B either 
by path 1 (solid line) or by path 2 (dashed line) or by any other path. 
Because the force on the test charge q’ is not constant but changes with 
distance, we use Eq. 5.7’ to evaluate the work done in moving it from point 
A to point B 


B 
Waoe = f F- ds (5.7') 


Just as in the case of gravity, considered in Chapter 5, the force F needed 
to move q’ at constant velocity must be equal and opposite to the force exerted 
by the electric field of q, that is, because the force of the electric field on q’' is 


qg' 6, a force F = —gq'é is needed to move q’ with constant velocity, where @ a : — - 
is the electric field produced by q. Substituting this for F in Eq. 5.7’ we have q@—— as 
B 
Wass = —q’ [ é:ds (14.4) 
Because W,_.3 is independent of the path followed, we will evaluate it by \ A. 
moving tangentially from point A to C and then radially from point C to point 2 


be 


B (see Fig. 14-4). During the first leg of this trip (A to C), the work done is A 
zero because @ is perpendicular to the displacement ds. However, the work FIGURE 14-4 
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done from C to B is @- ds = € ds cos 180° = —€ ds, and Eq. 14.4 becomes 
B 
Wass = 4 [. ds (14.5) 


As we move a distance ds toward B from point C, the radius r decreases, 
which introduces a negative sign, that is, ds = —dr. Using this in Eq. 14.5, 
we obtain 


B 
Wasp = —G' J. dy (14.6) 


The electric field for a point charge q is given by Eq. 14.3. Substitution of Eq. 
14.3 for € in Eq. 14.6 yields 


' (7%. dr 
Waos = — ne = 


ATE TA r 


Note that we have put ra instead of rc as the lower limit of the integral 
because ra = rc and we are evaluating the work done in moving q' from A 
to B. Integrating obtains 


7/11 
Wie = (- 2 = (14.7) 


By definition (see Section 5.3), the work done in moving an object between 
two points in a force field is equal to the difference in the potential energy 
E, between the two points; that is, 


' fi 1 
E(B) — E,(A) = (-. - " (14.8) 


TTEg \'p TA 


Equation 14.8 gives the difference in the potential energy of the two charges 
when q’ is located at two different points. It does not give the potential energy 
of the charges when q’ is at B or at A. For this, as indicated in Section 5.3, 
we must specify a reference point, that is, a point at which the potential 
energy is arbitrarily chosen to be zero. In electrostatics, this point is often 
chosen to be r = ©, that is, when the two charges are separated by an infinite 
distance. With this assumption, the potential energy of our two charge system 
g and q' when they are separated by a distance r is simply the work done in 
bringing one of them (for example q’) from infinity to r. Setting rg = © and 
fg = rin Eq. 14.8, we have 


E,(r) = as (14.9) 


4TTrE r 


This potential energy is called electric potential energy to differentiate it from 
the gravitational or the elastic potential energies that we encountered earlier. 
We should note that, because both q and q’ are positive, E, is also positive. 
To move q’ from infinity to r we have to do positive work, we have to overcome 
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the repulsive force between the two charges, that is, the external force must 
act in the direction of the displacement. 

The same is true if both g and q’ are negative. Equation 14.9 holds also 
in this case because the product of two negative charges will yield a positive 
value for E,. If the charges are of unlike sign, they will attract each other and, 
consequently, to move q’ at constant velocity, we will have to hold it back. 
We will then do negative work and therefore, the potential energy will be 
negative. It is seen that Eq. 14.9 agrees with this, for if q is positive and q’ is 
negative, or vice versa, E, will be negative. 

Let us now have two fixed charges q; and q2 at a distance 12 from each 
other. To achieve this, an amount of work equal to 


1 hh 
ATE 112 


had to be done; that is, the potential energy of the charges is 


1 4192 
= 14.10 

ATE Y12 ( ) 
Consider now a third charge q3 that is brought from infinity to point P as 
shown in Fig. 14-5. How much work must be done? Or equivalently, what 
is the change AE, in the potential energy of the charges? From Eq. 14.4, 
setting A = ~, B = P, and gq’ = qs, we have 


P 
AE, = ~¢s [ é:ds (14.11) 


We have already indicated that the electric field obeys the superposition 
principle. That is, 6 = €; + €>5, where €, and ©» are the electric fields 
produced by q; and qz, respectively. Substituting for € in Eq. 14.11, we obtain 


P 
AE, = -q | (€: + €2)-ds 


oC 


or | 
P i 
AE, = -43 | €,°ds — a | G2: ds (14.12) 


Equation 14.12 shows that the total work done in bringing q3 to point P is 
simply equal to the sum of the work done against the electric field produced 
by each charge in the absence of the other. Thus, by using Eq. 14.9 for q and 
g3 and for gq, and q3 independently, we may write 


A ee (14.13) 


e ATE 113 ATE 13 


We should remember that energy is a scalar quantity and, therefore, the 
sum of the two contributions to the potential energy in Eq. 14.13 is an algebraic 
sum, not a vector sum as in the case of the electric field. The total energy of 
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FIGURE 14-5 
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the three-charge system shown in Fig. 14-5 is obtained by combining Eq. 14.13 i -4 m—+}=-———5 m——+| 
with Eq. 14.10, that is, —@—__@ 
4) V5 q. 
1 (DR , 14 , 4s FIGURE 14-6 
ge Se ae a ee 14.14 
ATE, ( '12 '13 13 ( ) Example 14-2. 


Thus, for a system of charges, the procedure to follow is to calculate the 
potential energy separately for the pairs and then to add these algebraically. 


Example 14-2 


Three charges—q, = 3 X 10°°C, go = —5 x 10°°C, and gq; = —8 x 10°°C 
—are positioned on a straight line as shown in Fig. 14-6. Find the potential 
energy of the charges. 


Solution From Eq. 14.14, we may write 
N-m?2 | (3 x 10°° C)(—5 x 107° C) 
Ey = 9 x 10° “Co Saw ene 
, BX 1076 Of=8 x 10°66) 
9m 


* (-—5 x 10°°C\(-8 x 10°° C) 
5m 


E, = 1.43 x 10-2J 


14.4 ELECTRIC POTENTIAL 


In the preceding section we saw that when a test charge q' is moved from 
point A to point B work is done against the electric field produced by q. The 
amount of work done (Eq. 14.4) depends on the strength of the field and on 
the magnitude of the test charge q’. We can introduce a quantity, called the 
electric potential, with symbol V, which is independent of the test charge. The 
electric potential at a point P is defined as the work done in bringing a unit 
positive charge from infinity to the point. That is, from Eq. 14.4, setting 
A = ©, B = P, andq’ = +1C, we have 


P 
(2.2 Ves = J €+ds (14.15) 


Because the magnitude of the test charge g’ was set equal to unity, the po- 
tential at a point depends on the electric field alone and not on the test charge 
g’. However, if we know the potential at point P, we can immediately con- 
clude, by comparing Eq. 14.4 and Eq. 14.15, that the work done in bringing 
a charge q’ of arbitrary magnitude or sign to P is W = q’V(P). But this work, Alexander Volta (1745-1827). 


by definition, is the potential energy of the charge, and we therefore write 
E, = g'V (14.16) 


The SI unit of potential is known as the volt in honor of the Italian scientist 
Alexandro Volta (1745-1827). From Eq. 14.16 it is seen that one volt can be 
defined as one joule per Coulomb. 

We can use the results of the preceding section to evaluate the potential 
resulting from a point charge q at a distance r away from it. If we equate Eqs. 
14.9 and 14.16, we conclude that 


sae 


ra (14.17) 


In the preceding section, we saw that the work done in moving a charge 
in the resultant field of several charges could be found by summing the work 
done against the electric field independently produced by each charge. We 
therefore conclude that the potential resulting from several point charges is 
simply equal to the algebraic sum (remember that work is a scalar quantity) 
of the potential resulting from each charge. That is, 


—loa, le... 
ATE, 1 A7TE 1> 


(14.18) 


where 11, 72, . . . are the distances from q; and q2, respectively, to the point 
where the potential is being evaluated. 

In electricity, like in mechanics, one is often interested in the difference 
in potential between two points rather than in the absolute value of the 
potential at a point. A potential difference between two points is commonly 
referred to as a voltage difference or simply voltage. This difference can be found 
by applying Eq. 14.18 to the two points in question and finding the difference. 
Alternatively, the potential difference can be calculated directly from the elec- 
tric field. From Eq. 14.4 the work done in moving q’ from A to B is 


B 
ee [ €-ds (14.4) 


and by definition, this is equal to the difference in potential energy 
E,(B) — E,(A). Using the relation between potential and potential energy, Eq. 
14.16, we write 


Wa 6 

~Ast = AV = VB) - VA) = ~ [8d 
We can eliminate the minus sign by inverting the limits of integration, that 
is, 


AV = V(B) — V(A) = [ €-as (14.19) 


Several practical conclusions may be drawn from Eq. 14.19. Consider two 
plates, B which is positively charged and A which is negatively charged (see 


V(B) - 
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V(A) 
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Fig. 14-7). As we saw in Section 14.2, the electric field is directed away from 
the positive charges and toward the negative charges. Thus in Fig. 14-7, @ is 
directed from plate B to plate A. A unit of positive charge placed at B will be 
accelerated toward A. Noting that objects are accelerated when they move 
from a point to another of lower potential energy (recall the case of gravity), 
and remembering that by definition, the potential at a point is the potential 
energy of a unit of positive charge at that point, we conclude that V(B) > 
V(A). That is, the positively charged plate is at a higher potential than the 
negatively charged one. The result illustrated in this example can be gener- 
alized by stating that the electric field is directed from high potential points to low 
potential points, and that positive charges, if free to move, do so from high potential 
points to low potential points. For negative charges the opposite is true: A 
negative charge placed near plate A will be accelerated toward plate B; that 
is, negative charges are accelerated from low potential points to high potential points. 
In fact, not only can we say in what direction the charge will accelerate but 
we can calculate the velocity with which it will reach the other plate, if we 
know the potential difference between the plates. For this we use the con- 
servation of total mechanical energy, which in this case can be written as 


E,(B) + gV(B) = E,(A) + qgV(A) (14.20) 
That is, the sum of the kinetic and potential energies of a charge q at point 
B is equal to the sum of these energies at point A. 
Example 14-3 


A potential difference of 100 V is established between the two plates of Fig. 
14-7, B being the high potential plate. A proton of charge g = 1.6 x 10°” C 
is released from plate B. What will be the velocity of the proton when it 
reaches plate A? The mass of the proton is 1.67 x 10°’ kg. 


Solution Because the proton is released with no initial velocity, E,(B) is 
zero. From Eq. 14.20, we write 
EA) = 4 [V(B) — V(A)] = q AV 


or 
! 2 
5 ma = gq AV 


Solving for va 


2q AV 
a oo 


m 


7 P x 1.6 x 107% C x 100 V 
~ \ 1.67 x 10-27 kg 


1.38 x 10° m/sec 


aa, 
= 


¥(B) VA) 


FIGURE 14-7 


E,(B) + qV(B) 
= E,(A) + qV(A) 


14.5 THE ELECTRON VOLT 


A useful unit of energy is the electron volt (eV). Because electric potential 
difference is the work required per coulomb, then g AV is the energy required 
to move a charge g through a voltage difference AV. The charge of the electron 
isq =e = —1.6 X 10~" C (Section 13.4). If an electron is moved through a 
potential difference of 1 V (1 J/C) the energy change is 


le] AV = 1.6 x 10°? C x 1J/C = 1.6 x 10° J 


We define 1 electron volt (eV) as 1.6 x 107" J. Because the energies of 
electrons in atoms and solids are of the order of 1071 J, the electron volt is 
a convenient unit of energy to use in these cases. 

Suppose an electron is moved away from a positive charge through a 
potential difference of 100 V. The electron’s potential energy has therefore 
increased by 100 eV. By energy conservation, if the electron is now released 
from this point it will acquire a kinetic energy of 100 eV when it arrives back 
at its starting point. 


14.6 ELECTROMOTIVE FORCE 


In the electric circuits that we will develop in the next chapter, the symbol 
4 | will be used and labeled with a voltage magnitude, such as 10 V. This 


symbol represents a battery that is a source of electrical potential energy. A 
battery is a contained chemical reaction. There are two types, the wet, or 
rechargeable type used in an automobile, and the dry type, which is used in 
flashlights. When the chemical reaction in a wet cell is exhausted, it can be 
recharged a number of times by sending current through it in the reverse 
direction. When the chemical reaction in a dry cell is exhausted, the battery 
is “dead.” In both types there are two electrodes (plates or rods) whose exposed 
portions are called terminals, the anode and the cathode. These are suspended 
in an ionic solution in the wet cell and an ionic gel in the dry cell. The solutions 
and gels are called electrolytes; in wet cells the electrolyte is usually an acid. 
The anode is made of a material that strongly attracts the positive charges 
from the electrolyte whereas the cathode is made of a material that has a 
strong affinity for negative charges. As the anode and cathode attract their 
respective ions fromthe solution, they become electrostatically charged to 
the extent that they cannot attract further ions from the solution. In most 
electrolytes this balance of forces on the ions, that is, the attraction of the 
electrode versus the attraction of the solution occurs at around 1.5 V to 
2.0 V. This is the approximate voltage between the terminals of a chemical 
battery. If a wire is connected between the terminals of a battery, charges can 
flow between the terminals. As the charges on the electrodes decrease in 
number, the chemical action inside the battery again takes place and charges 
again migrate from the electrolyte to the plates. In this way the battery main- 
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The car battery is an example of 
a rechargeable wet cell. The stan- 


-dard D, C, or AA batteries used 


in flashlights, portable radios and 
many types of toys belong to the 


dry type. 
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tains a constant potential difference between the plates. This type of potential 
difference is called an electromotive force. This we now know is a misnomer. 
There is no “force” between the plates, only an electric potential difference. 
We simply call it emf. Higher emfs are obtained by combining cells in series. 


The symbol for battery mentioned earlier, 4 L, represents a single cell 


and is generally used for emfs of 1.5 V or less. The large line represents the 
anode as a source of positive charge, and the small line represents the negative 
side or the cathode. As mentioned before, early scientists assumed that the 
charges that flow were the positive ones. To this day we indicate charge flow, 


or current direction as emanating from the anode + . This is called con- 


ventional current, and its use does not affect the results of common circuit 
calculations. When a circuit diagram is used in which there is an emf source 


of several volts the symbol 4 | | | | L- is used, which represents many batteries 


in series. The number of these lines depends on space available or persistence 
of the draftsman, and one should not count the batteries drawn to obtain the 
magnitude of the emf. 


14.7 CAPACITANCE 


Suppose we connect the terminals of a battery to two parallel metal plates, 
as in Fig. 14-8. The plate on the left will quickly attain a negative charge of 
—q and the one on the right a positive charge of +q. The plates are char- 
acterized by having a charge q, the magnitude of the charge on either of them. 
It is evident that if the emf of the battery is small, the charge q on the plate 
will be small and if the emf is large, the charge q will be large. Experiments 
show that the charge is proportional to the potential difference, AV or emf, 


qx Vv 


where V actually means AV or voltage difference between the two terminals 
of the battery. We make this relation into an equality by introducing a constant 
C so that 


gq = CV (14.21) 


The arrangement of such a set of plates as in Fig. 14-8 is called a capacitor, 
and the constant C is called the capacitance. The constant has units of 


C= q (coulomb) 
~ V- (volt) 


which is given the special name of farad (abb. F) where 


1 farad = 1 coulomb/volt 


1” 


# 
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FIGURE 14-8 

Two parallel metal plates sepa- 
rated by an insulator, such as air, 
form a capacitor. When con- 
nected to a source of potential 
difference, the metal plates ac- 
quire equal but opposite charges 
gand —g. 


A farad is a very large quantity, and the usual capacitor in an electronic circuit 
is of the order of microfarads (1 wF = 10~° F) or picofarads (1 pF = 


10~ '* F). The symbol for a capacitor in an electric circuit is 4 L. 


Equation 14.21 represents the charge on a capacitor in a vacuum, and in 
air there is very little change. Suppose that some nonconducting material, 
either liquid or solid, is placed between the plates. It is found experimentally 
that the capacitor will have a higher charge for the same voltage by a factor 
x. The material placed between the plates is called a dielectric and the factor 
K is called the dielectric constant. Therefore, Eq. 14.21 is written as 


g = xCV (14.22) 


where « for air or vacuum is unity (1). Some example of the values of x are 
given in Table 14-1. 


TABLE 14-1 

Material Dielectric Constant 
Vacuum 1 

Paper 3.5 


Rubber 7 


We can also see from Eq. 14.22 that if we wish to maintain a given charge q, 
less voltage is required with the dielectric present. That is, if it requires Vo 
volts to produce a charge q on the capacitor in a vacuum, then when a dielectric 
is introduced the same charge can be produced by a voltage V = Vo/k. 


PROBLEMS 


14.1 (a) What is the electric field at a point 0.12 m 
from a point charge of —4 x 10~? C? (b) What force 
would an electron experience if it were placed at that 
point? 


14.2. Two equal and opposite charges, q1 = 3 x 
10~° Cand qz = —3 x 107°C, are held 10 cm apart. 
(a) What is the electric field at the midpoint of the 
line joining the two charges? (b) What force would 


an electron experience if it were placed there? FIGURE 14-9 q 
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14.3. The two charges of problem 14.2 are placed 
on the x axis as shown in Fig. 14-9. What is the 
electric field at a point on the y axis located at a 
distance y from the origin? 


le—5 cm—|=—5 cm | 


Problem 14.3. 


14.4 Two charges, gq, = 7 x 10°? Candq = —14 
x 10~°C, are placed on the x axis as shown in Fig. 
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14-10. (a) Find the points where the electric field is 

zero. (b) What is the magnitude and the direction 

of the electric field at point P with coordinates x = 0, 
y = 20cm? 

(Answer: (a) 36.2 cm to the left of q,, 

(b) 1.70 x 10* N/C, 6 = —28.8°.) 


FIGURE 14-10 
Problem 14.4. 


14.5 Consider the arrangement of charges shown 
in Fig. 14-11. What is the electric field at point A? 


@4 


FIGURE 14-11 
Problem 14.5. 


14.6 Four charges of equal magnitude are placed 
at the corners of a square as shown in Fig. 14-12. 
What is the electric field at the center of the square, 
point O? 


FIGURE 14-12 
Problems 14.6 25 1 
and 14.7. q. hax IO0-"C «@ 


14.7 Consider the charge configuration of problem 

14.6. (a) What is the electric field at point A? (b) 
What is the electric field at point B? 

(Answer: (a) 3.77 x 10° N/C directed toward gp, 

(b) 6.19 x 10° N/C directed toward point O.) 


14.8 Two large parallel plates are separated by a 
distance of 5 cm. The plates have equal but opposite 
charges that create an electric field in the region be- 
tween the plates. An a particle (q = 3.2 x 10°C, 
m = 6.68 X 10~*’ kg) is released from the positively 
charged plate, and it strikes the negatively charged 
plate 2 x 10° sec later. Assuming that the electric 
field between the plates is uniform and perpendic- 
ular to the plates, what is the strength of the electric 
field? 

(Answer: 522 N/C.) 


14.9 Anelectron is projected with an initial velocity 
v; = 3 X 10° m/sec in the x direction in the region 
between two oppositely charged plates (see Fig. 
14-13). By the time the electron leaves the region 
between the plates, it has undergone a vertical de- 
flection of 2 cm. Assume that the electric field be- 
tween the plates is uniform and perpendicular to the 
plates and that the electric field outside the region 
of the plates is zero. (a) What is the strength of the 
electric field between the plates? (b) At what point 
yf On a screen 1 m away from the plates will the 
electron land? 

(Answer: (a) 8.19 N/C, (b) 10 cm.) 


c. 
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FIGURE 14-13 Problem 14.9. 

14.10 A uniform electric field € = 500 N/C exists 
in the region between two oppositely charged plates 
(see Fig. 14-14). How much work is done in moving 
a charge q = 6 X 10 ©C from A to P with constant 


velocity (a) along path ABP, (b) along path ADP, 
(c) along the straight line path AP? 

(Answer: (a) 1.5 X 10~° J, (b) same as (a), 

(c) same as (a).) 


| FIGURE 14-14 
Problem 14.10. 


14.11 The electric field between two parallel plates 
is uniform and perpendicular to the plates. The po- 
tential difference between the plates is 100 V, and 
the separation between the plates is 1 cm. What is 
the strength of the electric field between the plates? 

(Answer: 104 N/C.) 


14.12 What is the potential difference between the 
plates in problem 14.8? 
(Answer: 18.3 V.) 


14.13 A charge g, = 3 X 107° C is brought from 
infinity to the origin of a set of coordinate axes. A 
second charge q2 = 2 X 10°-° Cis brought also from 
infinity to a point with coordinates x = 5.cm, y = 
0 cm. (a) How much work is done in bringing g,? 
(b) How much work is done in bringing g2? (c) What 
is the potential at x = 2.5 cm, y = 0 cm? (d) How 
much work is done in bringing an electron from 
infinity to the point x = 2.5cm, y = 0 cm after q, 
and q2 have been placed at the locations indicated 
above? 


14.14 (a) What is the potential at point P in Fig. 
14.9 of problem 14.3? (b) How much work must be 
done to move an electron from point P to the origin? 


14.15 (a) What is the potential at point O in prob- 
lem 14.6 (see Fig. 14.13)? (b) What is the potential 
energy of a charge g = 1 X 10°° C when it is placed 
at point O? (c) How: much work must be done in 
bringing q from infinity to point O? (d) How much 
work must be done to move g from point O to point 
A? 

(Answer: (a) 0 V, (b) 0 V, (c) 0 V, (d) 0 V.) 


14.16 Charged particles are accelerated through a 
potential difference of 250 V. What will be the kinetic 
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energy in eV if the particle is (a) an electron, (b) a 
proton, (c) an @ particle (g = + 2e), (d)a gold nucleus 
(q = +79e). e is the magnitude of the charge of the 
electron and is 1.6 x 10°” C. 


14.17 In a given vacuum tube, an electron is re- 
leased from the heated filament with zero velocity. 
It is attracted by the positive plate and arrives at the 
plate with a velocity of 4 x 10° m/sec. What is the 
voltage of the plate with respect to the filament? The 
mass of the electron is 9.1 x 107~*1 kg. 


14.18 Ana particle (g = +2e) is shot directly to- 
ward a gold nucleus (g' = +79e) with a kinetic en- 
ergy E, = 6 MeV (6 Xx 10° eV). How close does the 
a particle get to the gold nucleus? Assume that the 
gold nucleus remains stationary. 

(Answer: 3.79 X 10714 m.) 


14.19 A particle with charge q, = 4 x 10~° C is 
held fixed at some point in space. A second particle 
of mass 20 g and charge q. = —5 X 107° Cis placed 
3.cm away from the first particle. What velocity must 
be given to q2 so that it will reach infinity with zero 
velocity? 


14.20 Two protons are held fixed 10 cm apart. A 
third proton is projected from far away with some 
initial velocity v; as shown in Fig. 14-15. (a) What is 
the minimum value of v; that will allow the proton 
to reach the midpoint of the line joining the two 
fixed protons? (b) If the initial velocity is half the 
value found in part (a), how close to point 0 will the 
proton get before it stops? 

(Answer: (a) 3.32 m/sec, (b) 1.94 x 107? m.) 


= —' rT | = | = 


FIGURE 14-15 —_e—- ——iT CCU 
Problem 14.20. 
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14.21 An electron is placed midway between two 
fixed charges, gq, = 2.5 x 10719 C and gq. = 5 xX 
10~*° C. If the charges are 1 m apart, what is the 
velocity of the electron when it reaches a point 
10 cm from q2? 


14.22. Two particles are placed 1 m apart. Particle 
1 has a mass m, = 20 g and a charge q,; = 6 X 
10~°C. Particle 2 has a mass mz = 50 g and a charge 
q2 = —4 X 10°°C. The particles are released from 
rest simultaneously. (a) What will be the velocities 
of the particles when they are 0.5 m apart? (b) At 
what distance from the initial position of particle 1 
will the collision occur? 
(Answer: (a) ¥; = 3.93 m/sec, v2 = 1.57 m/sec, 
(b) 0.714 m.) 


14.23 An electric dipole consists of two charged 
particles of mass m = 300 g and charge qg,; = 3 x 
10~° Cand q2 = —3 x 107° C connected by a rigid 
rod of negligible weight and length d = 20 cm. The 
dipole is placed in a region where there is a uniform 


electric field € = 5000 N/C (see Fig. 14-16). (a) What 
is the torque exerted by the electric field when 6 = 
30°? (b) How much work must be done to rotate the 
dipole from the angular position 6 = 0° to 6 = 90°? 
(c) If the dipole is pivoted about its midpoint and is 
released from the angular position 0 = 90°, what 
will be the angular velocity of the dipole when it 
swings back to 6 = 0°? 
(Answer: (a) 1.5 X 107-2 N-m, (b) 3 x 1077 J, 
(c) 3.16 rad/sec.) 


FIGURE 14-16 
Problem 14.23. 
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15.1 INTRODUCTION 


In this chapter we consider the motion of electrons in a conductor (a metal) 
when there is a voltage difference applied between the ends of the conductor. 
In more common language, how will the electrons in a metal wire behave if 
the wire is attached to the two opposite terminals of a battery whose potential 
difference creates an electric field in the wire? After consideration of the basic 
behavior, we will treat combinations of wires, batteries, and electrical mea- 
suring instruments. It will be necessary to understand the rules for charge 
flow, a current, in circuits in order to follow the arrangement of logic circuits 
in a computer, which will be treated in Chapter 27. We will limit our discussion 
mostly to direct currents, that is, currents whose magnitude and direction do 
not change with time. 


15.2 MOTION OF CHARGES IN AN ELECTRIC FIELD 


We have seen in Chapter 14 that the definition of electric field strength @ is 
the force per unit positive charge 


F 
6€=- 
q 
We may substitute Newton’s second law F = ma 
e=— 
q 
or 
gé 
= — 15.1 
a= 7 (15.1) 
Note that in Eq. 15.14 represents an arbitrary charge. In the case of an electron, 
gq = e, where e = —1.6 X 10°” C and the mass of an electron is m = 
9.1 x 10°°%! kg. Equation 15.1 will now be written as 
— lelé 
= ana 15.2 
a=— (15.2) 


in which the negative sign tells us that the direction of acceleration of an 
electron is opposite to that of the field direction. Unless there is a specific 
need to know the direction of motion, we may just use the magnitude of the 
acceleration in Eq. 15.2. For example, suppose a constant electric field is 
suddenly applied to a metal. What velocity will the electrons have after trav- 
eling a distance s, assuming that no scattering (or collisions) occurs over that 
distance? This is simply a problem from Chapter 3 in which we wish to find 
a final velocity when the initial velocity is zero and the displacement and the 


André Mane Am 
1836). 


pére (1775- 


constant acceleration are known. From Eq. 3.11, we write 


So, = 20x 
When 
Vo = 0, x=s8 
and 
e% 
. = — 
m 
the velocity is 
2eés 
v= \{/—— 
m 


Note that in this expression for v, e stands for the magnitude of the charge 
of the electron. 


15.3 ELECTRIC CURRENT 


In the preceding section we saw that an electric field in a conductor can cause 
charges to undergo accelerated motion. This acceleration is terminated by 
collisions of the electron with the atoms in the conductor. That is, the motion 
of an electron in an electric field is a series of short accelerations interrupted 
by collisions that scatter the electron. It therefore has a random path, although 
there is a slow net velocity opposite to the field direction, such as illustrated 
schematically in Fig. 15-1. It is the net velocity of the electrons, called the drift 
velocity, that gives rise to the current, not the brief accelerations. 

If we stand at a particular plane perpendicular to a wire and count the 
charge Ag that flows by in time At we define this as electric current 1, where 


Aq 


ie C/sec 


i= (15.3) 
The definition of the electric current given by Eq. 15.3 holds only if the 
rate of charge flow is constant. In the general case where 7 is not constant, 


we define it as 


44 


i= dt (15.3’) 


The concept of electric current is similar to the concept of measuring the 
current in a river. One measures the quantity of water, in gallons, cubic feet, 
or such, which flows past a point in a given time. 

Because the charge Aq is a scalar quantity, the current 7 is also a scalar. 
In the SI units, current is measured in amperes, or amps, after André Ampere 
(1775-1836). One ampere (1 A) is equal to one coulomb per second and is a 
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FIGURE 15-1 


The random path (black lines) of 
an electron resulting from colli- 
sions with the ions and the effect 
of an electric field on the path, 
with a resulting drift velocity 
(colored lines). [Source: David 
Halliday and Robert Resnick, 
Fundamentals of Physics, 2nd ed. 
Copyright © by John Wiley & 
Sons, Inc. Reprinted by permis- 
sion of John Wiley & Sons, Inc.] 
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relatively large quantity. Usually in electronic circuits the current is much 
smaller than 1 A, and we use the milliampere (1 mA = 10-3 A) or the 
microampere (1 wA = 10~° A) 

Consider now a cylindrical conductor with a cross-sectional area A as in 
Fig. 15-2, and let us assume that there are both positive and negative charges, 
both of which are mobile in the presence of an electric field @ with a vector 
direction from left to right. Let us further assume that there are N, positive 
charges per unit volume with drift velocity of v, and N,, negative charges with 
drift velocity of v,,. In time At the positive charges will move from left to right 
a distance of v,At. Therefore, in time At, all the positively charged particles 
within the shaded region of the cylinder of cross-section A and length v,At, 
and only those particles, will flow out of the shaded region of the cylinder 
to the right. The volume of the shaded cylinder is Av,At, and the number of 
positive particles within is the number per unit volume N, times the volume 
or N,Av,At; if each has a charge q,, the charge flowing across the right end 
of the cylinder is 


Aqp = Ip Np A %, At 


Substituting this into Eq. 15.3 gives the current resulting from the positively 
charged particles as 


; _ Atk 
PAE 
_ wNp A U, At 
At 
iy = Jp Np AV, (15.4) 


In the same way, the negative particles, each with charge qn, flow from right 
to left giving rise to a current 


in = Qn Nn A Un (15.5) 


It is seen in Eq. 15.4 that the current i, of positive charges is to the right 
in Fig. 15-2 because both the sign of the charge qp and their drift velocity are 
positive and, hence, their product is positive. The current i, resulting from 
negative charge motion also results in an effective current of positive charges 
to the right by subtraction of the negative charges. This is seen in Eq. 15.5 
in which both the sign of the charge q, and the sign of the drift velocity 2, 
are negative and therefore their product is positive: A flow of negative charges 
to the left is equivalent to a flow of positive charges to the right. When both positive 
and negative charges move, the total current i is the sum of these two currents 
or : 


1= 1, + ly 


= A(qp Np Up + Gn Nn Un) (15.6) 


be 


FIGURE 15-2 


1 = Ady Np Vy + Gy Ny Un) 


It is seen by Eq. 15.6 that a measure of electric current does not tell whether 
the current is being carried by positive or by negative charges or by both. 

The vector drift velocity of the positive charge carriers v, is in the same 
vector direction as that of the electric field vector €. The direct current i in a 
conductor has the same direction as that of the electric field ©. 

We note that as much charge flows into the cylinder of Fig. 15-2 as flows 
out. There is no pileup of electric charges in the wire at any point. If there 
were, then the local electric field would be stronger at that point which would 
increase the net flow of charge past that point until the charge density at 
every point in the wire would again be equal. If we connect a wire between 
the terminals of a battery, it is therefore reasonable to conclude that charge 
flows at a steady rate throughout the wire. 

It is often convenient to introduce a quantity called the current density, 
with symbol J, which is the current per unit cross-sectional area. This elim- 
inates the area A from Eq. 15.6; thus 

J = = Alm? (amp/m?) (15.7) 


Example 15-1 

Suppose a copper wire carries 10 A (amps) of current and has a cross-section 
of 10~° m’. As will be seen later, each atom of copper contributes one electron 
that is free to move, so the electron carrier density N,, is about the same as 
the density of atoms, which is about 7 x 10°° atoms per m? (see problem 
9.2). The charge on an electron is —1.6 x 10°’ C. (a) What is the drift velocity 
UV, Of the electrons? (b) How long would it take an electron to move from one 
terminal of a battery to the other if this wire were 1 m long? 


Solution 
(a) 1 = Ag, Ny Vp 


1 
A Gn Nn 


10 A 
10°-° m? X 1.6 X 107°'°C x 7 X 10282 m-3 


9 x 1074 m/sec 


x 1m 


— oN see 1.1 x 10° sec = 18 min 


(b) t = 


So the actual drift velocity of a given electron is very small. However, when 
we turn on a light switch, the lamp will light almost immediately regardless 
of the distance from switch to lamp. The reason is that the speed of propa- 
gation of the electric field along the wire is that of the speed of light in the 
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wire. Therefore, all electrons are acted on almost simultaneously by the electric 
field and begin to drift. 


15.4 RESISTANCE AND RESISTIVITY 


We have seen that the electric field @ is the force per unit charge and, hence, 
the field causes the charges to be accelerated. The collisions with atoms scatter 
the charges, which are then accelerated again. This accelerating-scattering 
process gives rise to a net drift velocity, resulting in an electrical current in 
the direction of the field 6. Experiment shows that in many cases the electric 
current i, hence the current density J, are proportional to @. 


Jx@ 


We can change this proportionality to an equality by introducing a quantity 
p, called the electrical resistivity 


6 =p] (15.8) 


This resistivity is a property of a given material and is independent of its 
shape. The resistivity was found to be a constant for a given metal at a given 
temperature by George Ohm (1789-1854); Eq. 15.8 is called Ohm’s law. A 
material obeying Ohm’s law is called an ohmic conductor, that is, one with a 
linear relation between current density and electric field. A material with a 
nonlinear relationship is called a nonohmic conductor. For example, in Chapter 
26, we will see that a linear dependence does not hold in the case of a circuit 
element called the diode. 
From Eq. 15.8 the units of p may be determined 


_ (NC) _ & a) 
— J(C/sec-m?) J Ce 
This is such a cumbersome unit that it is shortened to Q-m (ohm meter). 
Some typical values of p for conductors and insulators are given in 
Table 15-1. 


TABLE 15-1 
Material p (Q-m) at room temperature 
Silver 1.5 x 1078 
Copper 1.7 x 1078 
Aluminum 2.7 X 1078 
Glass od | Oe 
Teflon 107" 
Dry wood a ae 


It is seen that differences in resistivity between insulators and conductors 
can be as great as 22 orders of magnitude (powers of 10). One of the early 


George Ohm (1787-1854). 
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successes of the theory of solids was to explain this. We will develop this 
theory in Chapters 23 and 24. 

It is sometimes convenient to use the reciprocal of the resistivity; this is 
called the conductivity and has the symbol o. Equation 15.8 may be written 
as 


jl = 0% (15.9) | = o€ 


where o = 1/p. 

Suppose we have a given metal wire with cross section A, length 1, and 
resistivity p with an applied electric field 6. The wire is shown schematically 
in Fig. 15-3. We can use Eq. 14.19 to relate the electric field inside the conductor 
to the potential difference between the two ends of the conductor, points 1 


and 2. 
AV =V,-—V> = [°8-as (14.19) 
If the electric field inside the conductor is uniform, the integral becomes €I, 
and 
AV = €l 

where 

l= S —- $1 
or 

AV 
¢=-— (15.10) 


Substitution of Eq. 15.10 for € and Eq. 15.7 for J in Eq. 15.8 yields 


which is written | 
V = iR (15.11) V = iR 


where V actually means AV or voltage difference between the two ends of 
the wire. This equation (15.11) is also commonly called Ohm’s law. In this 
equation R = pl/A and is called the resistance of the wire and has units of 0 
(ohms). It is seen that the longer the wire the more resistance it has to the 
current, but the larger its cross-sectional area, the less resistance it has. The 
analogy to water flowing through a pipe is useful. Voltage will be the equiv- 
alent of the difference in water pressure and current that of volume per second = 
of flowing water. $} 

We should note an important fact about the direction of the current 
through a resistance. At the beginning of this chapter, Section 15.3, we in- : 
dicated that the current has the same direction as that of the electric field. In Vv; | V2 
Chapter 14, Section 14.4, we demonstrated that the electric field is directed FIGURE 15-3 
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from high potential points to low potential points. We therefore conclude 
that the current in a resistance is from its high potential side to its low potential side. 


15.5 RESISTANCES IN SERIES AND PARALLEL 


In this and succeeding sections we will study some simple electrical circuits. 
Resistances will be drawn with the symbol—”/— and emf sources will be 


drawn as |-for a small emf and- [|| H-for a larger emf. In all cases, the large 


line represents the positive, or higher potential, side of the emf. A simple 
direct current (dc) circuit with one emf source and one resistance is drawn 
as in Fig. 15-4, where the arrow represents the direction of the current. 

In Fig. 15-4 and in the following circuit diagrams the emf sources are 
labeled with the letter V and the voltage magnitude is given. This is the 
conventional labeling, but it does not conform well to the definitions. If one 
measures the potential difference between the terminals of a battery, this is 
AV but, as already mentioned, it is customary to refer to a voltage difference 
simply as the ‘voltage.’’ We have also seen that the accepted name for such 
a voltage source is emf. Confusion in terminology often arises for the student 
when the form of Ohm’s law of Eq. 15.11 is used. The V in this equation is 
the potential, or voltage drop across a resistance R when a current i passes 
through it. If one takes a meter that measures electric potential difference, 
called a voltmeter, it will read a voltage V when the probes are placed on 
opposite sides of the resistance. The concept of this voltage difference, com- 
monly called a voltage drop, across a resistance should not be confused with 
the potential difference across the emf source. 

Suppose that we replace the single resistance of Fig. 15-4 with three re- 
sistances (called resistors) of different values R,, R2, and Rs, as in Fig. 15-5a. 
We assume in this type of calculation that the connecting wires have zero 
resistance. Therefore, the electric potential at point A is the same as that at 
the left side of the battery, and that at point D is the same as the right side 
of the battery. The same current must pass through each of these resistances 
as that which passes between points A and D. This combination is therefore 
called series resistances because the current passes through each sequentially. 
By Ohm’s law (Eq. 15.11) we may write the voltage drop across each resistance as 


Vas = iR,, Vec = iR>, Veo = IRs 


and, because the sum of these voltage drops must equal the potential differ- 
ence between A and D, Vap, which is the emf of the battery, we conclude 
that 

V = Vas +t Vac + Veo 


= 1Ry a 1R> = 1R3 = (Ry i R, ae R3) 
V = iReg 


=a 
{| | 
FIGURE 15-4 ; 


(i?) 
y 
FIGURE 15-5 
(a) Three resistors in series. (b) 
Equivalent resistor R... 
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where R,, is the equivalent resistance of the three. That is, the same current 
would flow through the circuit if the three resistances were replaced by a 
single one (Fig. 15-5b) with magnitude 


Rag = Ry Si R, a R; (series) (15.12) 


It is obvious that the generalization of Eq. 15.12, namely, R.g = 2 Ri, will 
be true regardless of the number of resistances in series. i 


Example 15-2 


Suppose in Fig. 15-5 the voltage V = 1.5 V and the resistances are R; = 5), 
R, = 10 0, and R; = 15 2. What are the voltages Vas, Vpc, and Vcp? 


solution First we find the current through the resistors by replacing the 
individual resistances with a simple equivalent resistance. 


V =i1R.g = 1 (Ri + Ro + Rs) 


15V 
fee 0 Ko 
'= 6 +10 + 15) 0 = 


Then, applying Ohm’s law to each resistance 


Vac = iR, = 0.05 A X 100 = 0.50 V 
= 0.05A X 150 = 0.75 V 


Voep = IR; 
| sum = Vap = 1.5 V 


Suppose we now arrange these resistances in parallel, as in Fig. 15-6a. As 
stated previously, we assume the resistance of connecting wires to be neg- 
ligible and, for clarity, we redraw the circuit of Fig. 15-6a in the form of Fig. 
15-6b. Because all connecting wires are considered to have zero resistance, 
there can be no voltage drop across them. Therefore, the left side of each 
resistance is at the same potential and the right side is at the same potential; 
hence, the same voltage drop V must occur across each. We further note that 
although the current through each resistance may be different, the sum of 
the individual currents must equal the current that flows through the wire 
connecting them to the battery because charge must be conserved. Thus, 


i=i, +i +i, (parallel) (15.13) 


Because each resistance has the same voltage drop V across it, we may write 
Ohm’s law for each 
V = 11K, V — 1oR>, V — 13R3 


and 


Req — K, + ie r R-  .. 


FIGURE 15-6 
(a) Three resistors in parallel. (b) 


Conventional form of drawing 
resistors in parallel. (c) as a 
lent resistor R.. 
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Substituting these into Eq. 15.13 gives 


viv UY 
Ri, Ro Rg 
1 1 1 
=V(—+—4+= 
+e re 
or 
1 
 heesa 
R, R> R3 
1 ‘ 
~ TP Res 
Reg 
where 
1 1 1 1 
4 (parallel) (15.14) 


=—+—+ 
Req Ri Ro Rs 


We see that any number of parallel resistors can be replaced with an equivalent 
resistor by a generalization of the relation of Eq. 15.14. The equivalent circuit 
looks like Fig. 15-6c, where R,, is given by Eq. 15.14. 


Example 15-3 
Suppose two resistors, R; = 5 Q and R2 = 10 Q, are connected in parallel to 
a 1.5-V battery as in Fig. 15-7a. (a) What is the current through each? (b) What 


is the total current in the circuit? 


Solution 
(a) Using Ohm’s law (Eq. 15.11) 
V= 11R1, V= IoR> 


V 15 V 
=— = —— = 03A = 300mA, 
1 R 50 - 
V 1.5V 
= — = —— =0,15A = 150mA 
2" R 100 - 


(b) i = 4) + 2 = 300 mA + 150 mA = 450 mA 


We may check this answer by solving the equivalent circuit, Fig. 15-7b. 


1 1 1 1 1 
=— + — = —— + — = 0.20'1'4+010'1=0307! 
Req Ri Ro 50 100 


V= 1L5V 
FIGURE 15-7 
Example 15-3. 
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or 
R, = —— = 3.339 
“4 0.3071 ~ 
Vo 15V 
p= — = ——— = 045A = 450 mA 
‘Ro 3.330 “_ 


Example 15-4 


Three resistors are connected in a combination of series and parallel as in Fig. 
15-82. What is the current through each? 


Solution First we find Req) for the parallel combination 


a oe oe a 


; | 
= + + — = 0.507! + 0.25071 = 0.7507! 
eq(p) Ri Ro 20 4 


Reap) = 1.33 Q 


R 


We then have the equivalent circuit, Fig. 15-8b. We now find the equivalent 
series resistance Reqs) 


Reqs) = Reap) = R3 = 1.33 Q+30 = 4.330 


We now have the simpler equivalent circuit of Fig. 15-8c. The current is given 
by Ohm’s law 


p= —— = —* = 035A = 350mA 
Regs) 4.33 0 a 


We may return to Fig. 15-8b and note that we have already solved part of the 
problem because all this current flows through Rs, hence, 13 = 350 mA. We 
may find the voltage drop across the parallel combination by use of Ohm’s 
law | 


Vip) = iReqip) = 350 mA X 1.33.0 = 0.47 V 


The current through each of the parallel resistors is then 


Vie. 0.47V 

- Vp 37 V _ 99035 A = 235 mA 

1 Ry 20 a 
Voy _ 9-47 V _ yo iig A = 118 mA 


1 = = 
> Rs 4a, 
And, except for the rounding-off error, 1; + i. = Iz = 1 
You will have noticed that when there are two resistors in series or in 


parallel, a method of ratios exists as a quicker way of solution. Consider first 
a series circuit as in Fig. 15-9. The current through R, is the same as that 


(c) 
FIGURE 15-8 


FIGURE 15-8 
Example 15-4. 


FIGURE 15-9 
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through R2, and by Ohm’s law 
j2M oY 
Ry’ R, 


where V, and V, are the voltage drops across R, and Rz, respectively. Equating 
the 7’s gives 


Le peee: 
Ri, R 
or 
V, ®R | 
V, = R (series) (15.15) 


so that in a series circuit the ratio of the voltage drops is equal to the ratio of the 
resistances. 

A different ratio can be written for parallel resistors (see Fig. 15-10). In 
this situation we recall that the voltage across each is the same. From Ohm’s 
law 


Vi = Ri, Vo = bR2 
Equating V; and V2 gives 


1,R, = IoR> 
or 
1 R 
+ <2 (parallel) (15.16) 
lo Ry 


where we see that in a parallel circuit the ratio of the currents through each resistor 
is inversely proportional to the resistances. We might have expected this result 
from the understanding that resistance impedes the flow of current; hence, 
the larger the resistance the lower the current. 


15.6 KIRCHHOFF’S RULES 


Not all electrical circuits can be reduced to simple series or parallel combi- 
nations. Two fundamental rules were established by G. R. Kirchhoff (1824-1887) 
that aid in the solution of electrical networks. 


1. The algebraic sum of currents toward any branch point is zero. 
2. The algebraic sum of all potential changes in a closed loop is zero. 


We will consider rule (1) first. As we have stated earlier, charge cannot 
accumulate (or deplete) in a dc circuit: If it did, there would be a larger (or 
smaller) electric field at that region which would exert a larger (or smaller) 


FIGURE 15-10 


Gustav R. Kirchhoff (1824-1887). 


force and thereby redistribute the charge evenly. Therefore, at any branch 
point in a circuit, whatever charge flows in must flow out. This is seen in 
Figs. 15-11a and b. An analogy would be a series of hoses for water irrigation. 
At a branch of hoses, whatever water flows in must flow out. 

The current equation for Fig. 15-11a, 1; = ig + is, is an obvious application 
of the first rule. The rule can also be used for the circuit of Fig. 15-11b. Consider 
branch point A. The current equation would be i, = 13 + 1, if the currents 
have the directions indicated by the arrows. We do know that current iz has 
the direction indicated by the arrow. But we do not know if the arrow is the 
correct direction for i; since we do not know if the potential at point A is 
higher or lower than that at point B. We therefore assume a direction and 
maintain that assumption in formulating other equations. When we finally 
solve the circuit equations, if 7, is positive our assumption is correct; if it is 
negative, then the current 73 is in the direction opposite to our assumption. 
To illustrate the consistency of following the original assumption, rule (1) 
applied to branch point B would be t + i3 = 1s. 

Let us consider rule (2). This rule is a statement of the conservation of 
energy. In a circuit there may be potential differences associated with emf 
sources present as well as voltage drops associated with resistors. If we men- 
tally start at a point in the circuit, go around any closed loop in either direction 
adding algebraically all the changes in potential and then return to the starting 
point, the potential of that point must be the same as when we started; that 
is, the sum of all the potential changes (increases and decreases) considered 
in our mental trip must add up to zero. 

In applying rule (2), it is useful to follow certain guidelines that will 
prevent errors in the signs of the potential changes. 


(a) As indicated in connection with rule (1), we first assume a direction for 
the current through each branch of the circuit. 


(b) We then choose any closed loop in the circuit and designate the direction 
(clockwise or counterclockwise) in which we wish to mentally traverse it. 


(c) We now go around the loop in the chosen direction adding ape unacary 
all the potential changes and setting the sum equal to zero. 


When we meet an emf source, its voltage V is taken as positive if we cross 
the source from the negative (low potential) side to the positive (high poten- 
tial) side. The reason for taking V as positive is that in going from the negative 
side of the source to the positive one, the change in potential represents an 
increase in electric potential. If the source is crossed from positive to negative, 
its voltage is taken as negative because the electric potential has decreased. 
Let us now consider what to do when we meet a resistor. Earlier, we indicated 
that in a resistor the current goes from the high potential side of the resistor 
to the low potential one and that the potential drop between the two sides 
is iR. Thus, if in our mental trip around the circuit loop we cross a resistor 
in the same direction as the current, we must take the iR drop as negative 
because we are going from high to low potential—a decrease. The iR drop is 


— — 
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(a) 


FIGURE 15-11 
Kirchhoff’s rule for current at a 


branch point. 
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taken as positive if the resistor is traversed in the direction opposite to that 
of the current. 

Now we can apply rule (2) to some simple circuits. Consider the circuit 
of Fig. 15-12. We choose the current in the clockwise direction (we may just 
as well choose the opposite direction, although the correctness of our as- 
sumption is obvious by inspection in this simple circuit). If we now traverse 
the loop in the clockwise direction starting at point A, we apply rule (2) and 
write 


Note that both iR drops are written as negative because both resistors were 
crossed in the direction of the assumed current. V was taken as positive 
because the emf source was crossed from the negative to the positive side. 
We can rewrite the result as 


V = 1R, + 1R> 
which is the result obtained when we discussed resistors in series where rule 
(2) was implicitly used. 
Let us now consider the slightly more complicated circuit of Fig. 15-132. 


We may again choose to go around the loop in a clockwise direction starting 
at point A. From rule (2) we obtain 


—1iR, — 1Ry + V2 + Vi = 


| 
oS 


or 
V> + Vi a iR, “F 1R> 


We see that because the batteries are pointing in the same direction (relative 
to the direction of the current) the effective voltage of the two emf sources 
is the sum of the individual voltages, that is, an emf source of voltage V = 
V, + V2 would give rise to the same current. Suppose we reverse the direction 
of one of the sources, as in Fig. 15-13b. We will still assume that the current 
is in the clockwise direction. With such an assumption, and if we traverse 
the loop in the clockwise direction, we write 


—iR, — 1Ro + Vo — V; = 0 


Note that V; is now taken as negative because in our mental trip the emf V;, 
was crossed from the positive to the negative side. The result can be solved 
for i 


. V2—-V; 
' R, + Ro 
It is clear that if V; > V2, i would be negative, indicating that we have assumed 
the wrong direction for i, that is, i would be counterclockwise. 

Let us now use Kirchhoff’s rules to solve a circuit with certain similarities 
to a transistor circuit that we will encounter in a later chapter. 


FIGURE 15-12 


(a) 


(b) 
FIGURE 15-13 


Example 15-5 


In the circuit of Fig. 15-14, (a) Find the currents ic, ig, and ig and the voltage 
drop across resistors R,; and R>. (b) Find the voltage difference between points 
C and D and between D and E. 


Solution 
(a) From the first rule at branch point B 
ic + ip = if 
We then write the second rule for the two loops. For the right-hand 


loop, if we traverse it in the counterclockwise direction starting at 
point D, we write 


V> IcR> — IicRe = inks = 0 
9V — ic 5000 — ic 1000 2 + ip 60 0 = 0 
9V — ic 15000 + 7260 = 0 


For the left-hand loop, traversing it counterclockwise, we write 
V, — igRg — eRe — tek, = 0 
1.5V — ip 600 — 7.1000 — 500 = 0 
1.5V — ig 600 — it, 1500 = 0 


We now have three equations to be solved simultaneously for ic, ig, 
and ig. They are 
Ic a ip —_ Ip 


0 


9V — ic 15000 + tg 60 2 
1.5V — ip 600 — % 1500 = 0 


We can use the first equation to eliminate ig from the last two. ig = 
ip — ic, therefore 


9V — ic 15000 + (ig — ic) 600 = 


| 
oO 


or 
9V — ic 15600 + 600 = 0 (15.17) 
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FIGURE 15-14 
Example 15-5. 
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(b 


ee” 


1B 


and 


15. V — (ig — ic) 600 — ig 1500 


| 
S 


or 
1.5V — % 2100 + ic 600 = 0 (15.18) 


Equations 15.17 and 15.18 can be solved for ig and ic. Multiplying 
Eq. 15.17 by 7 and Eq. 15.18 by 2 and adding them together eliminates 
iz, that is, 


7X 9V — ic 1560 + & 600) = 0 
2X (1.5 V — & 2100 + 1¢ 600) = 0 
63 V — ic 10,9200 + 3V 4+ ic 1200 = 0 


es 66 V 
~~ 10,800 Q 


= 6.1 x 10°°A =61mA 


We can now solve for ig using either Eq. 15.17 or 15.18 


15 V 4+ ic 600 
: 210 0 


1.5 V + (6.1 x 10-3 A)(60 Q) 
210 9 


8.9 x 10°° A = 8.9mA 


Finally, we can use the result of the first rule, ic + ig = ig, to obtain 


| Ip = Ip — Ic = 8.9mA — 6.1mMA = 2.8mA. 

The voltage drop across R, is | 
V = igR, = 8.9 X 1072 A X 500 = 0.45V_ 

and the voltage drop across Rz is 

V = icR, = 6.1 X 10°° A X 5000 = 3.1 V 


To find the voltage difference between points D and C, let us follow 
the current ic through the circuit elements in the right-hand loop. 
Start with the potential Vp. 


Vp + V2 — tcR2 = Ve 

Vo — Vp = V2 — icRo 
9V —6.1 x 10°°A x 5000 
6V 
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We may do the same with the left-hand loop to find the potential 
difference between points D and E. 


Vp — ipRp — tpRe = Ve 

Ve — Vp = —ipRg — ipRe 
-2.8 x 10°°A x 600 
- 8.9 x 10°°A x 1000 
—-1.1V 


The result shows that D is at a higher potential than E. 


15.7 AMMETERS AND VOLTMETERS 


We will see in the next chapter that electric current passing through a wire 
produces a magnetic field. If a loop of wire is used then, on the passage 
of current, one end of the loop becomes the north pole of a magnet and the 
other end becomes the south pole, as in Fig. 15-15a. This will be discussed 
in more detail in Chapter 16. Many loops can be used (see Fig. 15-15b), and 
each one contributes to the forming of a magnet: The larger the number of 
loops, the stronger the magnet for a given current. A series of loops forms a 
coil, and if a tightly wound coil is placed between the poles of a permanent 
horseshoe magnet as in Fig. 15-15c and current passes through it, the induced 
north pole of the coil will be repelled by the north pole of the permanent 
magnet. If the coil is suspended by a flexible metal strip (see Fig. 15-15c), the 
twisting (torsion) force of this metal strip acts as a spring and will oppose the 
rotation of the coil, causing it to return to its initial position at zero current. 
Thus, depending on the strength of this metal strip and the other design 
parameters of the mechanism, a full-scale deflection of the instrument needle 
can be established for a given amount of current through the coil. This in- 
strument is called a galvanometer. The current for full-scale deflection is called 
the current rating of a meter. Because these instruments are electrically and 
mechanically delicate, a common current rating is 0.1 mA (10~* A). 

Used by itself, such a meter could only measure currents from 0 to 
0.1 mA when placed.in series in the circuit. To extend the range of the meter, 
a lower resistance, called a shunt, is placed in parallel with the meter. Figure 
15-16 shows these situations in which the meter could measure full scale for 
different currents in the line. Meters used for the measurement of current 
though a circuit are known by the general name of ammeters (meters to measure 
amps of current). The resistance of the coil R, in the galvanometer is also 
specified by the manufacturer, in addition to the current rating. A typical 
value might be 1000 2, or one kilohm, kQ. From Ohm’s law the voltage drop 


FIGURE 15-15 
The basis of a galvanometer. 
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across the galvanometer in all cases of Fig. 15-16 must be 
V = iR, = 10-*A X 102 =01V 


and this must also be the voltage drop across the shunt because it is in parallel 
with the meter. With the use of Ohm’s law we may calculate the value of the 
shunt resistance. In Fig. 15-16b 


In Fig. 15-16c 


Many test meters have an external switch that changes the scale of the am- 
meter. This switch disconnects one shunt and introduces one of a different 
resistance into the circuit so that the same meter is used for different ranges 
of current. 

An instrument to measure the voltage difference between two points in 
a circuit, say, two sides of a resistor, is called a voltmeter and can be made 
from a similar galvanometer. However, we do not want such a voltmeter to 
disturb the current flow through the resistor because such a change would 
alter the 1R voltage drop. The ideal instrument would be one that had infinite 
resistance. However, the galvanometer requires that current pass through it 
to obtain a measurement. We will continue to use the galvanometer previously 
discussed, which requires 100 uA for full-scale deflection and has an internal 
resistance of 1 kO (1000 9). As shown before, this meter will read full scale 
when the voltage difference across itis V = 10°*A x 10° = 0.1 V. Suppose 
we wish to extend the range of the galvanometer to 10 V full scale. We would 
connect a resistor R2 in series with the galvanometer so that the potential 
drop across the galvanometer is still 0.1 V, as in Fig. 15-17. To find the value 
of Rz, we note that the voltage drop across the galvanometer V,, = 0.1 V, 
plus the voltage drop across R2, iR2, must be 10 V, that is, 


0.1V+107-*A xX R, =10V 

_ 99V 

*~ 10-4 A 

Similarly, if the meter is to read full scale across a voltage drop of 100 V, the 
drop across R2 must be 100 — 0.1 = 99.9 V and the value of R, must be 

_ 9.9V 

~ 10-4A 

Again, an external switch on a meter connects different values of series re- 


sistances so that full-scale deflection of the meter may indicate different max- 
imum voltages. 


=99 x 10°02 


Rp = 9.99 x 10° 0 


FIGURE 15-16 
The construction of different am- 
meters from a galvanometer. 
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FIGURE 15-17 

The construction of a voltmeter 
(enclosed by the dashed semicir- 
cle) from a galvanometer. 
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15.8 POWER DISSIPATION BY RESISTORS 


We saw at the end of Chapter 6 that in an elastic collision between an electron 
and an atom, very little energy is transferred to the atom—most of the kinetic 
energy is retained by the electron in its recoil (bouncing off the atom). 
However small, some energy is lost by the electron to the atom. This is the 
situation in a metal when electrons, accelerated by the electric field, collide 
with the atoms. Because many collisions are taking place, each small energy 
loss adds to a considerable amount. The kinetic energy transferred to the 
atoms per unit time represents an energy loss per unit time by the electrons, 
which is a power loss. We have seen in Chapter 9 that temperature is a 
measure of the average kinetic energy of the atoms (or molecules) of a system. 
Therefore, we expect any conductor to heat up when an electric current is 
passed through it. We see this phenomenon daily in electric heaters, ovens, 
and light bulbs. 

The calculation of power dissipation P in an electrical resistor R as a result 
of the passage of a current i can immediately be found by considering Fig. 
15-18. Let V, and Vx represent the potentials of points A and B, respectively, 
and Vas the potential difference. The change in potential energy of a charge 
Ag entering at A and leaving at B is 


AE, = Aq (Vg — Va) 


This represents an energy loss because V4 is greater than Vz. In a given time 
At the amount of charge involved is Ag = iAf, so 


AE, = VaB 1 At 
The power dissipated in the resistor is Paz = AE,/At, or 
| Pag = Vapi 


We have written the voltage with the subscript AB to denote that if there is 
more than one resistor in a circuit, the power lost in each is the product of 
the voltage across each and current flowing through each. With this point in 
mind, we may write in more general form 


P = Vi (15.19) 


Two other forms may be obtained by substitution of Ohm’s law, V = ik. 
These are 


= j2R (15.20) 


and 
(15.21) 


We have shown in Chapter 5 that the unit of power is the watt or J/sec. 
The consistency in electrical definitions is readily seen from Eq. 15.19. The 
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FIGURE 15-18 


Pp = # 


Pp = 
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unit of voltage is one joule per coulomb (one volt) and the unit of current is 
one coulomb per second (one ampere). Therefore the unit of power is 


P = Vi = (1JC~!) (1 C sec“*) = 1 J/sec = 1 W (watt) 


15.9 CHARGING A CAPACITOR—RC CIRCUITS 


Thus far we have limited our discussion to cases where the current is constant 
with time, that is, direct current. In this section we consider a circuit where 
the current varies with time. This circuit plays an important role in the op- 
eration of computer clocks, which will be presented in Chapter 27. 

In Chapter 14 (Section 14.7) we saw that when a capacitor is connected 
to the terminals of a battery, the plate of the capacitor connected to the positive 
side of the battery acquires a positive charge g = CV (Eq. 14.21) and the other 
plate an equal but negative charge —q. One question that we may ask is: 
How long does it take for the charges to appear on the plates of the capacitor? 
Obviously, because resistors determine the current (that is, the rate of charge 
flow) in the circuit, this will depend on the resistance that is in the circuit. 

Let us consider a circuit where a resistor KR (the resistance of the con- 
necting wires is assumed to be negligible) and a capacitor C are connected in 
series by means of switch S to a battery of emf V as in Fig. 15-19a. The initial 
condition is that when the switch is open there is no charge on the capacitor. 
When the switch is closed, a current is set up in the circuit and the capacitor 
will begin to charge (see Fig. 15-190). 

Let q be the charge on the capacitor at some time ft after the switch is 
closed and let i be the current through the resistor at the same instant. Vap 
is the voltage across the terminals of the battery, that is, Van = V, Vag is the 


voltage drop iK across the resistor, and Vgp is the potential difference ’ across 
the plates of the capacitor. We therefore write 
Vap = Vas + Vep 


Or 


V =iR + ‘ (15.22) 


By definition i is the rate at which charges flow through the resistor and, 
because these charges cannot cross the gap between the capacitor plates, this 
rate represents the rate at which the charge on the capacitor is increasing, 


d 
that is, 7 = mn . Equation 15.22 becomes 


(15.23) 


R 


FIGURE 15-19 
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Dividing both sides of Eq. 15.23 by R and rearranging terms we have 


Veg _& 
R RC dt 
Or 
_ dg 
dt = VG (15.24) 
RRC 


We multiply both sides of Eq. 15.24 by —1/RC and integrate with the limits 
gq = 0 when t = 0 and charge q is on the plates at time ft. 


1 
= [, at -[ me oy a 
~ RC 4 — 4 
R 


RC 


fio, (Y_ 4 
aa cama (j a) 


t  , (VY 4\_, (VY) | 
-gn= In a J) In 6 (15.25) 


Because In x — In y = In : , Eq. 15.25 becomes 


Taking the antilog of both sides we obtain 


oe or 
e = 1] CV 
Solving for q, we get 
g= CVC Aes) (15.26) 


Let us wanalyee Eq. 15.26. Att = 0,q = CV(1 — e-°) = CV (1 — 1) = 
0. This agrees with the fact that at t = 0 (when the switch was closed) the 
capacitor was uncharged. As ¢ increases, the exponential term in the paren- 
thesis decreases and consequently q increases (the capacitor is being charged). 
As t > ©, e “RC —» 0 and q — CV, the ultimate charge on the capacitor. 
Although it takes an infinite amount of time to fully charge the capacitor, it 
takes a finite amount of time to get very close to the final value q = CV. 
Moreover, this time is determined by the product RC, which is called the 
time constant of the circuit. For example, when t = RC,g=CV(1-e") = 
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0.63 CV; when t = 4 RC, q = CV (1 — e-*) = 0.98 CV. We see that after ; 
a few time constants, q is very close to its ultimate value. A plot of q versus 
t is shown in Fig. 15-20. CV 

The consistency of the electrical definitions of R and C can be readily 
verified in Eq. 15.26. From Ohm’s law, 


—_— ee eee ee ee 


V volts volts 


] 
| 
I 
I 
I 
| 
RC 4 RC f 


similarly from Eq. 14.21 FIGURE 15.20 
Cc Charge accumulation on a Ca- 
q ale eee 
C= >= pacitor in an RC circuit as a func- 
V volts 
tion of time. 
therefore 
volts C 
= seconds . 


~ C/sec volts 


The mathematical solution of the circuit equation shows that the larger 
the value of the resistor R and of the capacitor C, the longer it will take to 
charge it. It is not difficult to understand the physical reason for this result. 
The larger R, the smaller the current through the circuit at any one time, 
hence the smaller the rate at which the capacitor is being charged. Similarly, 
the larger C, the more charge it can store for a given voltage and, obviously, 


the longer it will take to charge it. 


PROBLEMS 


15.1 A wire carries a current i = 1 A. How many 
electrons pass a fixed cross section of the wire in 1 
sec? 


15.2 Copper has one conduction electron per atom, 
that is, each atom contributes one electron that is 
free to move through the solid. The density of cop- 
per is 9 g/cm? and its molecular weight is 64 g/mole. 
A wire carries a current of 10 A. The cross-sectional 
area of the wire is 3 mm. (a) What is the current 
density? (b) What is the number of conduction elec- 
trons per m°? (c) What is the drift velocity? 
(Answer: (a) 3.33 X 10° A/m2, (b) 8.47 x 10° m~?, 
(c) 2.46 x 1074 m/sec.) 


15.3. A copper wire 15 m long has 8 x 10*° mobile 
electrons. What is the drift velocity of the electrons 
if the current in the wire is 5 A? 


15.4 The resistivity of copper at ambient temper- 
ature is p = 1.7 X 107° O-m. What is the resistance 
of a copper wire 5 m long and 2 x 107° m in di- 
ameter? 


15.5 A copper wire (9 = 1.7 x 1078 O-m) 10 m 
long and 1 x 10~° m in diameter carries a current 
of 2 A. What is the potential difference across the 
ends of the wire? 


15.6 Inthe earth’s atmosphere positive charges move 
toward the earth and negative charges move away 
from it. The total current is approximately 1800 A. 
The average value of the electric field responsible 
for this current near the surface of the earth is 
100 N/C. What is the resistivity of the air at the sur- 
face of the earth? The radius of the earth is 6.37 x 


6 
10° m. (Answer: 2.83 x 10° Q-m.) 


15.7 In the circuit of Example 15-4, let R; = 5 0, 
Ry = 100, R3 = 4Q, and V = 2V. Find the currents 
hh, I>, and 13. 

15.8 Inthe circuit of Fig. 15-21 (a) Find the currents 


through each resistor. R; = 3 0, R. = 6Q, R3 = 
60, Rg = 12 0, V = 18 V. (b) What is the total 


current 1? 


FIGURE 15-21 
Problem 15.8. 


15.9 The current through R; in the circuit of Fig. 
15-22 is 0.2 A. (a) What is the current in R,, R2, and 
R4? (b) What is the voltage of the battery? 

(Answer: (a) 1.1 A, 0.5 A, 0.4 A, (b) 4.2 V.) 


— 


FIGURE 15-22 
Problem 15.9. 


15.10 How many possible resistance values can be 
obtained with three resistors R; = 500, Ro = 100Q, 
15.11 Inthe circuit of Fig. 15-23, find tp, the voltage 
drops across R, and R>, and the voltage differences 
Ve — Vo and Vo — Ve. 


¢ Re =402 Bg Ro = 4002 ¢ 


FIGURE 15-23 ) 
Problem 15.11. Vy 1.5V Vo 6 V 
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15.12 The current 7 through R, in the circuit dia- 
gram of Fig. 15-24 is 40 mA. (a) What is the current 
through Ro, Rs, and R,? (b) What is the potential 


difference between A and B? 
\ 


Rag = 42 


1 EW 
Z x: ¥ 
A, | | 


FIGURE 15-24 | 
Problem 15.12. V2 


15.13 The voltage drop across Rg; in the circuit dia- 
gram of Fig. 15-25 is 4 V. (a) Find the currents through 
the resistor R,, R2, and R3. (b) What is the resistance 
of R,? 


(Answer: (a) 0.5 A, 0.3 A, 0.8 A, (b) 1.67 .) 


V5 LSV 


FIGURE 15-25 
Problem 15.13. V, =6V 
15.14 Find the currents through the resistors Rj, 
Ry, and R3 of the circuit of Fig. 15-26. 


1 V 


“ V3 

RK 300 $2 _ 
[3 | | | | 

Vo 


b V 


FIGURE 15-26 
Problem 15.14. 


15.15 The circuit of Fig. 15-27 is known as the 
Wheatstone Bridge. It is used to find the resistance 
of an unknown resistor R, in terms of three known 
resistors R,, Rx, and R,. The value of R, is adjusted 
until no current flows through the galvanometer G. 
(The arrow over the resistor symbol of R, indicates 
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that R, is a variable resistor.) Let R; = 10 Q and 
R, = 100 ©. If no current flows through G when 
R, = 470 ©, what is the value of R,? 

(Answer: 4700 22.) 


FIGURE 15-27 hi | | sed 


Problem 15.15. 


15.16 A galvanometer has an internal resistance of 
2000 © and a current of 50 wA will cause full-scale 
deflection. What shunt resistance is required to use 
it as an ammeter. whose full scale reads 0.1 A? 
(Answer: 1 ©.) 


15.17 The galvanometer of problem 15.16 is to be 

used as a voltmeter with a maximum scale reading 

of 10 V. What series resistance is required? 
(Answer: 1.98 x 10° ©.) 


15.18 An electric light bulb marked 100 W is used 
in a home in which the wall outlet is at 120 V. What 
is the resistance of the filament in the bulb? 


15.19 An immersion heater draws 3 A when it is 
plugged in a 120-V wall outlet. What is the power 
consumption of the heater? 


15.20 If the immersion heater of problem 15.19 is 
used to boil a cup of water (m = 150 g) initially at 
27°C and 80% of the power is absorbed by the water, 
how long will it take for the water to boil? 


15.21 Anelectric heater of resistance 5 (2 is plugged 
in a 120-V outlet by means of an extension line. 
Compare the power loss in an extension line 5 m 
long when the line is made of No. 12-gauge copper 
wire (2.5 mm in diameter) and when it is made of 
No. 14-gauge wire (1.6 mm in diameter). The resis- 
tivity of copper is 1.7 x 107° Q-m. 

(Answer: 9.90 W, 23.96 W.) 


15.22 A megawatt (10° watts) of electrical power is 
needed to run a factory. Compare the energy losses 
in the transmission lines when the voltage is 120 V 
with when it is 6000 V. 


15.23 A resistor R = 1000 0 and a capacitor C = 
100 uF are connected in series with a 10-V battery 
and a switch. (a) How long after closing the switch 
will the voltage across the capacitor be 1.0 V. (b) 
When will the capacitor be charged to 99% of its 
final charge? 

(Answer: (a) 1.05 x 10~? sec, (b) 0.46 sec.) 


15.24 A capacitor C in series with a resistor R is 
charged by turning the switch to position a in Fig. 
15-28. After the capacitor has been charged, the switch 
is returned to position b. Find an expression for the 
charge on the capacitor as a function of time. Take 

= 0 at the moment the switch is changed from a 
to b. 


D 
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FIGURE 15-28 
Problem 15.24. 
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16.17 INTRODUCTION 


Almost everyone has performed elementary experiments with bar magnets. 
If the bar magnet is suspended by a thread or supported by a pivot, one of 
the ends will point in a northerly direction. This end of the magnet is called 
the north pole of the magnet, with symbol N. The opposite end of the magnet 
is called the south pole, with symbol S. Elementary experiments also show 
that like poles repel and unlike poles attract. This suggests that there is 
something that we call a magnetic field by which poles can exert forces on each 
other. This field is similar to the two other fields we have already considered, 
the gravitational field and the electric field. There is one important difference, 
however: If we break a bar magnet in half, we cannot make single poles, but 
instead we will have two bar magnets. The broken end becomes the south 
pole of the half that has the north pole, and the other broken end becomes 
the north pole of the half that has the south pole. 

There is an intimate relation between the motion of electric charges and 
magnetic fields, and our technological society is largely based on this rela- 
tionship, from the generation of electric power to many types of electronic 
devices. We will not deal with all these; instead we will consider only those 
effects which we need for the understanding of the concepts of modern 
physics presented in later chapters, namely, the magnetic field of a wire coil, 
the magnetic moment of a current loop, the force of a magnetic field on a 
moving charge, and the nature of electromagnetic waves. 


16.2 MAGNETIC FIELDS 


We may map magnetic fields by using a small compass that we will represent 
by a small arrow with its head as the north pole of the compass magnet. We 
arbitrarily define the direction of the magnetic field at a given point to be the 
direction in which the compass points. Figure 16-1 shows the fields of a bar 
magnet and a horseshoe magnet. The fields are indicated by continuous lines 
from the north to the south pole, and the number of lines is arbitrary, although 
in the comparison of two magnets the stronger one is customarily represented 
by a greater number of field lines. The direction of the magnetic field at a 
given point is the tangent to the field line at that point. In future drawings 
we may represent the field direction by a single arrow on a line. 

In 1820 the Danish physicist Hans Oersted (1777-1851) found that there 
is a magnetic field associated with current flowing in a wire. The direction of 
the magnetic field for a long, straight wire is schematically represented in 
Fig. 16-2. The field lines are circular about the wire. There are many concentric 
field lines, but the field becomes weaker as we move away from the wire. 
The direction of the magnetic field is determined by a right-hand rule. If the 
thumb of the right hand_is pointed in the direction of the current and the 
fingers are curled, the circular direction of the fingers is the direction of the 
magnetic field. 


= 
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FIGURE 16-1 
Magnetic field lines in between 


the poles of a bar magnet and a 
horseshoe magnet. 


Hans Christian Oersted (1777- 
1851). 
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Suppose we have a circular loop of wire that carries current, as in Fig. 
16-3. If we apply the right-hand rule, we see that one side of the loop has a 
north pole perpendicular to the plane of the loop and its opposite side will 
become a south pole. The arrows represent the magnetic field direction inside 
the loop. These field lines return outside the loop so that the loop itself becomes 
a magnet. This situation will remain regardless of the shape of the current 
loop; that is, a rectangular loop will give the same result. 


16.3 FORCE ON CURRENT-CARRYING WIRES 


In the preceding sections, we have presented some qualitative facts about 
magnetic fields. We have seen that magnets exert forces on other magnets. 
We have also seen that a wire carrying a current produces a magnetic field, 
that is, becomes a magnet. 

Experiment shows that when a wire carrying a current is placed in a 
magnetic field, it will experience a force. We can use this to define the mag- 
nitude of a magnetic field, B. (Remember that the direction of B has been 
defined as the direction taken by the north pole of a compass). 

Figure 16-4 is a schematic drawing of an experiment which shows that 
when a wire carrying a current is placed in a magnetic field B the force F is 
in a direction that is perpendicular to the plane defined by the field and the 
direction of the current. We also find experimentally that there is no force on 
a wire if the wire is in the direction of the magnetic field and that the force 
F is proportional to the sine of the angle @ between the field and the wire. 
The experiment also shows that the force on the wire is proportional to the 
current in the wire 7 and to the length of wire Al in the field. From these 
experimental results we can define the magnitude of the magnetic field B as 
follows: 

F 


° = FAlsin 6 ann 


F i 


FIGURE 16-2 

Magnetic field lines around a 
long, straight wire carrying a 
current 1. 


FIGURE 16-3 

Magnetic field lines on a plane 
through a current-carrying cir- 
cular wire loop. 


FIGURE 16-4 
Force on a current-carrying wire 


of length Al in a magnetic field. 
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It should be clear that Eq. 16.1 defines B unambiguously because, if we double 
or cut in half the current, the force on the wire will change accordingly. Similar 
arguments apply to Al and sin 8. 

From Eq. 16.1, the SI unit for B is newtons/ampere-meter (N/A-m). The 
name for this unit of B is the tesla (T). An older unit for the magnetic field 
still in use is the gauss (G), where 1 tesla = 10* gauss. The earth’s magnetic 
field is about 10+ T, so that 1 tesla is a large quantity. Having defined the 
magnetic field by Eq. 16.1, we can now state that when a segment of wire 
Al, carrying a current i, is placed in a magnetic field of magnitude B, it will 
experience a force 


F =iAlBsiné (16.2) 


The direction of this force is the perpendicular both to the magnetic field and 
to the direction of the current. We recognize that this relationship can be 
represented by a vector cross product. That is, Eq. 16.2 may be written as 


F=iAlxB (16.3) 


It is conventional to let the current i be a scalar quantity and to let Al be a 
vector pointing in the direction of the current. If we let the element of wire 
in Fig. 16-4 measure 0 at the reader’s end and let Al be the vector length in 
the magnetic field, then the right-hand rule for vector cross product discussed 
in Chapter 2 will yield a force in the upward direction perpendicular to the 
plane of vectors Al and B. 


16.4 TORQUE ON A CURRENT LOOP 


We are now able to understand the operation of the galvanometer that was 
used in Chapter 15 to construct ammeters and voltmeters. 

Consider a single rectangular loop of wire connected to a pivot rod, as 
shown in Fig. 16-5a. Let the length of sides 1 and 3 be a and that of sides 2 
and 4, b. We can use Eq. 16.3 to find the force on each side of the loop. For 
sides 1 and 3 the magnitudes of the forces are the same because the angle 
between Al and B is 90° and both wires have the same length a, that is, 


F, = F3; = 1 Al B sin 90° = iaB (16.4) 


From the definition of the cross product, we see that in Fig. 16-5a F, is out 
of the page toward the reader whereas F; is into the page. These two forces 
are drawn in Fig. 16-5b. Similarly the magnitudes of the forces on sides 2 and 
4 are equal. 


F, = Fy = 1 Al B sin (90° — 0) = 1ibB sin (90° — 86) 
The direction of F, in Fig. 16-5a is upward, while that of F, is downward. 


We conclude that there is no net force in any direction. However, if we 
look at the top view (Fig. 16-5b) along the pivot rod we see that there exists 


F=1Al xX B 
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a torque that tends to rotate the loop about the pivot rod. The torque is given 
by Eq. 8.5. 

 T=rxF (8.5) 
Applied to the present situation, we see that F, and F; exert a torque on the 
loop. The net torque is the sum of the individual torques caused by F, and 


F;, but because they are equal, we simply multiply the torque exerted by one 
of these forces by two, or 


7 = 21 xX F (16.5) 
7 = 2rF sin @ , (16.6) 


where F stands for either E, or F;. Substituting Eq. 16.4 for F in Eq. 16.6, we 
obtain | : 


7 = 2r iaB sin 0 (16.7) 
From Figs. 16-5a and 16-5b we see that r = b/2; therefore 
7 = iabB sin 0 (16.8) 


We recognize the product ab as the area of the loop. Calling this area A, Eq. 
16.8 may be written as 


7 = iAB sin 6 (16.9) 


Although Eq. 16.9 has been derived for a rectangular loop of wire, it can be 
shown that the result is the same for any other geometric configuration. 


FIGURE 16-5 

Torque on a current-carrying rec- 
tangular loop of wire on a pivot 
rod when placed in a magnetic 
field. (a) Side view. (b) Top view. 
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Equation 16.9 tells us that the torque is a maximum when @ = 90°, that is, 


when the plane of the loop lies in the direction of B, and it is zero when 
6 = 0°, that is, when Bis perpendicular to the plane of the loop. The magnitude 
of the torque can be increased by increasing the current in the loop. Because 
in the galvanometer this torque on the coil is opposed by the twisting (torsion) 
torque of the metal strip used to suspend the coil, the magnitude of the angle 
of rotation is a function of the current passing through the coil. An alternative 
way of increasing the torque on the coil is by using a coil made of several 
loops. To increase the sensitivity of the galvanometer at low currents, a coil 
of many loops of wire is used. 


16.5 MAGNETIC DIPOLE MOMENT 


In the preceding section, we saw that the important element in determining 
the torque on a wire loop, for a given field B in which it is placed, is the 
product of the area of the loop and the current through the loop (see Eq. 
16.9). This quantity is called the magnetic dipole moment or simply the magnetic 
moment of the coil with symbol wu, where 


= iA (16.10) 
The expression for the torque can now be written as 
T = pw B sin 0 (16.11) 


Equations 16.11 and 16.9 give the magnitude of the torque, but they do not 
specify the direction of 7. The direction of the torque can be obtained by using 
Eq. 16.5 


7 = 21x F (16.5) 


From the definition of the cross product, the direction of 7 in Fig. 16-5b is the 
perpendicular to the paper directed inward. We can specify both the mag- 
nitude and the direction of 7 in terms of the magnetic moment by assigning 
a vector direction to w. We define yw as a vector whose magnitude is given 
by Eq. 16.10 and whose direction is the perpendicular to the plane of the loop 
according to the right-hand rule. That is, we curl the fingers of the right hand 
in the direction of the current and the extended thumb indicates the direction 
of w (see Fig. 16-5b). We can now express the torque on the loop as 


7=pxB (16.12) 


From the definition of the cross product, it is clear that Eq. 16.12 yields the 
correct value for the magnitude (Eq. 16.11) and for the direction (Eq. 16.5) 
of 7. 

Because a magnetic dipole experiences a torque when placed in an ex- 
ternal magnetic field, work must be done by an external agent to change its 
orientation. As in the cases considered earlier (gravitational and electrical), 


uw = iA 


r= pxbB 


this work, by definition, becomes the potential energy E, of the dipole. Re- 
calling that only changes in potential energy are experimentally observed, we 
must define a zero or reference orientation. It is customary to set E, = 0 when 
6 = 90°, that is, when the dipole vector is perpendicular to the magnetic field. 
To calculate E, for any other orientation of «, we calculate the work using 
Eq. 8.13 


7] 
W, = [+46 (8.13) 


0 


Setting this work equal to E,, and substituting Eq. 16.11 for 7 in Eq. 8.13, 


] 
E, = I , MB sin 0 dé 
90 
] 
E, = —-B cos 6 
90° 
E, = —p@B cos 6 (16.13) 


This expression for E, can be written as a dot product (see Eq. 2.1), that is, 
E, = —p'B (16.14) 


We should notice that, because the cos 8 varies between 1 and —1, the maxi- 
mum energy is wB. This occurs when cos 0 = —1 or 0 = 180°, that is, when 
pm and B are antialigned. When pw and B are aligned, 0 = 0°, cos 6 = 1, 
and the potential energy is at its minimum value of E, = —wB. 


Example 16-1 


Assume that the electron in a hydrogen atom is essentially in a circular orbit 
of radius 0.5 x 107!° m, and rotates about the nucleus at the rate of 10% 
times per second. What is the magnetic moment of the hydrogen atom due 
to the orbital motion of the electron? 


Solution 


jf’ = area X current 


= mri 
where i is the current due to a single electron. Because current is defined as 
the amount of charge passing per unit time, we may view the electron’s orbit 
as a racetrack and ask how many times the electron passes a given point per 
second. The current is simply 
i = ev 


where v is the frequency of rotation—that is, the number of times the electron 
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passes a given point in its orbit per second—and e in the magnitude of the 
charge of the electron. 


fe = mr ev 
= 7 (0.5 X 107 m2 (1.6 x 107! C10" Hz) 
= 1.26 x 10°* A-m? 


Therefore, the hydrogen atom is essentially a small bar magnet and will 
behave as such in a magnetic field. 


16.6 FORCE ON A MOVING CHARGE 


We may use the development of Section 16.4 concerning the force ona current- 
carrying wire in a magnetic field to find the force experienced by a single 
charge. We saw in Eqs. 15.4 and 15.5 that we may write the current of either 
a positive or negative charge carrier as 


1=qNAv 


where q was the magnitude of a charge, N the number of charge carriers per 
unit volume, A the cross-sectional area, and v the average drift velocity of 
the charge carriers. If we substitute this into Eq. 16.3 we obtain 

F=i1AlxB (16.3) 
gNAvAl xB 


We see that A Al is the volume of the wire segment that is in the magnetic 
field B. The product of the charge density N (number of charges per unit 
volume) and the volume of the wire segment gives the total number of charges. 
Therefore, if NA Al is the number of charges experiencing a total force F, the 
force per charge carrier F, is F/NA Al and from Eq. 16.3 

rE, = NAA = qv xX (16.15) 
Note that this has been derived for a positive charge, because we implicitly 
assume that v is in the same direction as Al and therefore the current. If the 
charge carrier is the electron, the direction of the force is reversed. The force 
is perpendicular to the plane of v and B by definition of the vector cross 
product. It is therefore perpendicular to the direction of motion given by the 
vector v. The definition of work is the dot product of force and displacement 
dW = F-ds = F ds cos 9. Because the infinitesimal displacement ds has the 
same direction as the instantaneous velocity v, and because F is perpendicular 
to v, 8 = 90°, cos @ = 0, and the magnetic field does no work on the charge. 
The magnetic field therefore does not change the magnitude of the velocity 
of the charged particle. 


qv x B 


16.7 THE HALL EFFECT 


Suppose we have a conducting metal strip of width d and thickness ¢ con- 
nected in a circuit and placed in a uniform magnetic field B as in Fig. 16-6. 
Let the direction of the magnetic field be into the paper, indicated by the 
symbol &), which suggests the tail of an arrow. The electric field €, responsible 
for the current i will be directed to the right. If we assume for the moment 
that the current is caused by positive charges, their drift velocity v will be in 
the direction of €, as shown in Fig. 16-6. 

Let us consider two points D and C on the metal strip such that the line 
joining the two points is perpendicular to €,, (see Fig. 16-6). Without the 
magnetic field, the potential difference between these two points is zero 
because no work is done in moving a charge from one point to the other. 
When the magnetic field is turned on, the drifting charges will experience a 
force given by Eq. 16.15. We label this force Fz to indicate that this is the force 
caused by the magnetic field B. 


Fz =qv xX B=gvB (16.15) 


This force, illustrated in Fig. 16-7, causes the positive charges to move to the 
upper part of the conducting strip while they are moving to the right. Because 
the sample as a whole must remain neutral, the lower part of the strip will 
become negatively charged. This situation is also shown in Fig. 16-7. The 
accumulation of positive charges along the upper part and of negative charges 
along the lower part creates an electric field €, that opposes the further 
upward drift of positive charges. There will be a potential difference V;, 
between D and C associated with this electric field. From Eq. 14.19 


Va = Vo —- Ve = bd (16.16) 


where it is assumed that in equilibrium €, is constant and d is the width of 
the strip (the distance between D and C). This voltage difference is called the 
Hall voltage after the physicist who first measured it, and the phenomenon is 
called the Hall effect. It is clear that the equilibrium Hall voltage V;, will be 
established when the downward force of €, equals the upward force resulting 
from the magnetic field. Because the force of the electric field is given by 
definition in Eq. 14.1 as Fe = gé,, we can say that at equilibrium (which is 
quickly established) 


Fe = Fz 
qéy = qvB 
therefore 
éy = vB 


Substituting for €, in Eq. 16.16, we obtain 
Vo = UBd (16.17) 
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FIGURE 16-6 

Experimental arrangement for the 
measurement of the Hall volt- 
age. 


FIGURE 16-7 
Behavior of mobile positive 


charges in the arrangement of Fig. 
16-6. 
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Because the Hall voltage can be readily measured by connecting a volt- 
meter between D and C, the Hall effect permits the experimental determi- 
nation of the drift velocity of the charge carriers. We can obtain additional 
information if we use Eqs. 15.4 or 15.5 as follows 


1 = qNAv 
or 
1 
v= aNA (16.18) 
Substituting Eq. 16.18 for v in Eq. 16.17, we obtain 
| iBd 
= ar 16. 
Vu GNA (16.19) 


Note that A is the cross-sectional area of the foil, hence A = thickness (ft) x 
width (d). Therefore 


Vi = a (16.20 
= GN t 20) 
OT 
B 
Vu = Rua (16.21) 


where Ry = 1/qGN is called the Hall coefficient. Because i, B, and t are mea- 
surable, the magnitude of the Hall voltage will yield the value of N, the density 
of charge carriers. In the SI system of units, this density will be the number 
per cubic meter. 

Additional important information can be obtained from the Hall effect. 
In our discussion, we assumed that the charge carriers were positively charged. 
These charges were deflected toward the upper part of the foil, raising the 
potential of point D with respect to point C. Suppose, however, that the 
charge carriers are negatively charged particles. In this case, the drift velocity 
of the carriers will be opposite to that of €@,, as in Fig. 16-8. Although the 
velocity vector v is reversed, the direction of the force given by Eq. 16.15 is 
still upward because the charge q is negative. As a result, the upper part of 
the strip will have an accumulation of negative charges and the lower part 
an accumulation of positive charges. Point D will now be at a lower potential 
than point C. The polarity of the Hall voltage will tell which type of carrier 
is responsible for conduction. We will see in Chapter 25 that the semicon- 
ductors used in logic circuits can be made to have either positive or negative 
charge carriers. 


Example 16-2 


A current of 50 A is established in a slab of copper 0.5 cm thick and 2 cm 
wide. The slab is placed in a magnetic field B of 1.5 T. The magnetic field is 
perpendicular to the plane of the slab and to the current. The free electron 
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FIGURE 16-8 

Behavior of mobile negative 
charges in the Hall effect experi- 
ment. | 
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concentration in copper is 8.4 x 1078 electrons/m>. What will be the magnitude 
of the Hall voltage across the width of the slab? 


Solution Using Equation 16.19 
171Bd 
VS. 
0 Nq A 
50A xX 1.5T X 2 x 10°*m 
8.4 x 10° m3 x 1.6 x 10°C x 1074 m? 


1.12 x 10°° V 


16.8 ELECTROMAGNETIC WAVES: THE NATURE OF LIGHT 


In 1670, Christian Huygen had proposed that the propagation of light could 
be explained by assuming that light is a wave. Huygen’s theory was not 
widely accepted until 1801, when Thomas Young performed the first suc- 
cessful experiment that exhibited the interference of light. Even though Young's 
experiment established firmly the wave nature of light, one important ques- 
tion remained unanswered: What is the nature of the light wave? 

Starting with the fundamental laws of electromagnetism, James Maxwell 
(1831-1879) in 1873 showed that accelerated charges would produce electro- 
magnetic waves whose velocity of propagation c through free space should 
be 


c = 3 X 108 m/sec 


Maxwell and other physicists of that period also showed that an electromag- 
netic wave consists of an electric field 6 and a magnetic field B perpendicular 
to each other with both € and B perpendicular to the direction of their prop- 
agation. The spacial and temporal behavior of the electric and magnetic fields 
is identical to the traverse motion of the particles in a string when a traveling 
wave propagates through it. 

Suppose an electromagnetic wave travels along the x axis. If we measure 
the value of € and B at different points along the x axis, at some fixed time 
t we will observe that both 6 and B vary sinusoidally with x. This behavior 
is illustrated in Fig. 16-9. Similarly, if we sit at a fixed point in space and 
measure € and B at that point as a function of time, we observe that both 
vary sinusoidally with time. 

This behavior of the electric and magnetic fields can be represented math- 
ematically as 


é 


and (16.22) 
B = Bosin (kx — ot) 


Eo sin (kx ae wt) 


James Clerk Maxwell (1831-1879). 
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Equations 16.22 have the same mathematical form as the sinusoidal traveling 
wave that was introduced in Chapter 11 (see Eq. 11.4 and following). James 
Maxwell showed that any charge distribution that oscillates sinusoidally with 
time should produce electric and magnetic fields that behave as described by 
Eqs. 16.22. Moreover, the frequency w of the electromagnetic wave should 
be the same as the frequency of oscillation of the charges producing it. We 
should indicate at this point that no motion of material particles is involved 
in the electromagnetic wave, hence, there is no need for a medium of propa- 
gation. 

One of the key characteristics of a particular type of wave is its velocity 
of propagation. Maxwell’s theoretical prediction was that all electromagnetic 
waves should travel with velocity c = 3 x 10° m/sec. Within the experimental 
uncertainty this was the value that was measured for the speed of light in 
1849. This fact led Maxwell to postulate that light is an electromagnetic wave. 
Fifteen years after Maxwell’s calculations, Heinrich Hertz (1857-1894) was 
able to produce waves of electromagnetic origin using a circuit with an os- 
cillating current flowing through it. Hertz found that the speed of his elec- 
tromagnetic waves agreed, within the experimental uncertainties, with the 
value predicted by Maxwell. The common electromagnetic spectrum is listed 
in the following table, although there are no upper or lower limits. 


Electromagnetic Spectrum 


Name Frequency (Hz) Wavelength (m) 
Gamma rays 1073—10!° 107 4-107 '° 
X rays 107°-10!° 107 '*-1078 
Ultraviolet rays 10'7~10!° 10~°-10-° 
Visible light (4-7.5) x 10% (7.5-4) x 1077 
Infrared rays 10'4-10!! 10~-°-10~4 
Microwaves 10'2-10° 10°-4-107! 
Short radio waves 10°—10° 10~3-10~? 
FM, TV 10° | 
AM radio 10’-10° 107-10 
Long radio waves 10°—107! 10°—10’ 


FIGURE 16-9 

In an electromagnetic wave, the 
electric and the magnetic fields 
are at right angles to each other 
and to the direction of propa- 
gation of the wave. 


Heinrich Hertz (1857-1894). 
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All the laws of electromagnetic waves apply to waves of the entire electro- 
magnetic spectrum. Some wavelengths, such as visible light, are more ac- 
cessible for experiment than are other wavelengths, but all have been at least 


partially checked for consistency with this model. 


PROBLEMS 


16.1 What force is experienced by a wire of length 
1 = 0.08 m at an angle of 20° to the magnetic field 
direction carrying a current of 2 A in a magnetic field 
of 1.4 T? 


16.2 The earth’s magnetic field at the equator is 
4 x 10-° T and is parallel to the surface of the earth 
in the south-north direction. (Note that the earth’s 
geographic north pole is the magnetic south pole.) 
A wire 2 m long of mass m = 9 g is suspended by 
a string. The wire is also parallel to the earth’s sur- 
face and carries a current of 150 A in the east-west 
direction. (a) What is the tension on the string? 
(b) What would be the tension if the current was in 
the west-east direction? 

(Answer: (a) 10.02 x 10~?N, (b) 7.62 x 1072N.) 


16.3 A rectangular wire loop carrying a current 
i = 5 A is hung by a string from the end of a rod 
pivoted about its midpoint, as in Fig. 16-10. The 
lower part of the loop is in a region where there is 
a uniform magnetic field B = 2 T perpendicular to 
the plane of the loop as shown. What weight must 
be placed on the other end of the pivoted rod to 
balance it? 

(Answer: 8 N.) 


FIGURE 16-10 
Problem 16.3. 


16.4 The wire of Fig. 16-11 carries a current 1 = 
2 A. Find the force on each segment of the wire when 
it is placed in a region where there is a magnetic 
field B = 1.5 T directed along the positive y axis. 


J t" 
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FIGURE 16-11 
Problem 16.4. 


16.5 The wire loop of Fig. 16-12 carries a current 
of 2 A. It is placed in a region where there is a 
magnetic field B = 0.5 T parallel to the plane of the 
loop. (a) Calculate the force on each side of the wire 
loop. (b) What is the torque on the wire loop? 


FIGURE 16-12 
Problem 16.5 


16.6 The wire loop of Fig. 16-13 carries a current 
i = 10 A. It is placed in a uniform magnetic field 
B = 1.2 T. Let r = 2 m. (a) Find the net force on 
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the circular part of the loop. (b) What is the net force 
on the loop? 
(Answer: (a) 48 N, in the upward direction, (b) 0.) 


FIGURE 16-13 
Problem 16.6. 


16.7 What is the maximum torque that acts on a 
coil of wire that consists of 10 loops of diameter 0.04 
m and carries a current of 2 x 10~° A ina magnetic 
field of 3 x 10°? T? 


16.8 A wire of length / is to be used to make a coil 
of one or several circular loops through which a cur- 
rent 7 will be passed. (a) Show that the maximum 
magnetic dipole is obtained by making a single loop. 
(b) What is the dipole moment? 


16.9 How much work must be done to rotate the 
loop of Problem 16.6 from the position shown in Fig. 
16-13 to a position where the magnetic field is par- 
allel to the plane of the loop? 


16.10 Amagnetic dipole » with amoment of inertia 
I is placed in a uniform magnetic field B. Initially uw 
is in the equilibrium position, that is, parallel to B. 
The dipole is then rotated by a small angle @ and 
then released. (a) Show that the subsequent motion 
of the dipole is approximately simple harmonic. (b) 
What is the period of the motion? (Hint: For small 
angles 0, sin 0 ~ @.) 


i6.11 A ring made of an insulator with radius 
r = 0.2 m has a uniformly distributed charge q = 
4 x 10°*C. The ring is placed in the x-y plane of a 
region where there is a uniform magnetic field B = 
1.2 T directed along the positive y axis (see Fig. 16- 
14). The ring is rotated about the z axis with constant 
angular velocity w = 20 rev/sec. (a) What is the mag- 
netic moment of the rotating ring? (b) What is the 
torque exerted by the magnetic field on the ring? 
(Answer: (a) 1.01 x 107° A-m?, 
(b) 1.21 x 10-° N-m.) 


FIGURE 16-14 


16.12 A proton is moving with a velocity v = 
(3 x 10° i + 7 X 10° k) m/sec in a region where 
there is a magnetic field B = 0.4 j T. What is the 
force experienced by the proton? 

(Answer: (1.92 k — 4.48 j) x 107*N.) 


16.13 A proton is accelerated through a potential 
difference of 200 V. It then enters a region where 
there is a magnetic field B = 0.5 T. The magnetic 
field is perpendicular to the direction of motion of 
the proton. What is the force experienced by the 
proton? 


16.14 Acharged particle g is projected in the region 
between two parallel plates. In the region of the 
plates there is an electric field € = 50,000 N/C and 
a magnetic field B = 0.1 T. The electric field is per- 
pendicular to the magnetic field, and both are per- 
pendicular to the direction of motion, as shown in 
Fig. 16-15. If the particle goes through the plates 
undeflected, what is the velocity of the particle? 
(Answer: 5 X 10° m/sec.) 


@——_> | @B 


FIGURE 16-15 
Problem 16.14. 


16.15 A proton is accelerated through a potential 
difference of 300 V. It then enters a region where 
there is a magnetic field B = 0.8 T and an electric 
field 6. The electric field is perpendicular to the mag- 
netic field, and both are perpendicular to the direc- 
tion of motion of the proton (see Fig. 16-16). The 


proton moves through undeflected. What is the value 
of @? 
(Answer: 1.92 x 10° N/C.) 
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FIGURE 16-16 i 
Problem 16.15. x 2 ” x 


16.16 <A particle of mass m = 15 g and charge q = 
3 x 10°~° Cis moving horizontally near the surface 
of the earth with a velocity v = 50 m/sec. What is 
the magnitude and direction of the smallest mag- 
netic field B that will keep the particle moving in a 
straight line? Ignore the electric field mentioned in 
Problem 15.6. 


16.17 An electron is moving with a velocity v; = 
(2 x 10°i + 4 x 10° j) m/sec in a region where there 
is a uniform magnetic field, it experiences a force F, 
along the z axis. A second electron with velocity 
v2 = 3 X 10° k m/sec experiences a force F, = 7 X 
10~*° iN. (a) What is the direction and the magni- 
tude of the magnetic field? (b) What is the magnitude 
and direction of F,? 

(Answer: (a) 1.46 j T, (b) 4.67 x 10°° KN.) 


16.18 A strip of copper 1 cm wide and 1 mm thick 
has 50 A of current passing through it. The strip is 
in a magnetic field of 0.5 T directed into the paper 
(see Fig. 16-17). The voltage difference Vy = Vc — Vp 
= 2 x 107° V and the observation that Vc is larger 
that Vp indicates that the conduction is by electrons. 
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What is the density of the electrons responsible for 
the current? (e = 1.6 x 10°" C.) 


FIGURE 16-17 
Problem 16.18. 
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16.19 A rectangular slab of silicon of thickness 1 
mm is used to measure an unknown magnetic field 
B. The free electron concentration of that particular 
type of silicon is 6 x 10** electrons per m°®. When 
the slab is placed in the region of the magnetic field, 
perpendicular to the field, and the current in the 
slab is 20 mA, the Hall voltage is 150 wV. What is 
the strength of the magnetic field? 

(Answer: 7.2 T.) 


16.20 A long metal plate of width d = 1 cm is 
moved with constant velocity v in a region where 
there is a magnetic field B = 0.9 T (see Fig. 16-18). 
A potential difference V = 4.5 x 10° V appears 
across two points D and C. The line joining the two 
points is perpendicular to the direction of motion of 
the plate. What is the velocity v? 

(Answer: 0.5 m/s.) 
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17.1 INTRODUCTION 


By the end of the nineteenth century, most physicists felt rather good about 
the state of their art. In fact, some felt that their successors would spend their 
time simply taking measurements to the next decimal place. There were rea- 
sons for this complacent attitude. Most of the astronomical data about the 
motion of the planets, as well as the behavior of ordinary mechanical systems, 
could be explained using Newton’s laws of motion and his law of universal 
gravitation. The empirical laws concerning electric and magnetic fields had 
been discovered and fused together by Maxwell, and his prediction of the 
existence of electromagnetic waves had been experimentally verified by Hein- 
rich Hertz: The nature of light was no longer a mystery. More important, the 
same laws used to explain the behavior of macroscopic systems were also 
able to explain the behavior of submicroscopic objects (atoms and molecules). 
This came about with the development of the techniques of statistical me- 
chanics. By applying Newton’s laws statistically the ideal gas law, PV = nRT 
could be derived. Similarly, the specific heat of gases could be predicted in 
agreement with the available experimental data. 

There were a few minor problems. We will mention two of them that 
were instrumental in the advent of the scientific revolution that today we call 
modern physics. The principle of relativity seemed to fail when applied to 
electromagnetism. The principle states that the laws of physics should be the 
same in all inertial frames of reference. Someone performing experiments in 
a spaceship moving with constant velocity with respect to the earth obtains 
the same results from the experiments as does an experimenter on earth. 
Because physical laws reflect the results of the experiments, it follows that 
these laws have to be the same (must have the same mathematical form) in 
all inertial frames of reference. This mathematical invariance was shown to 
be preserved with the laws of mechanics, but it broke down with the laws 
of electricity and magnetism. This ‘minor’ problem eventually led to devel- 
opment of Einstein’s special theory of relativity. 

Another problem that baffled physicists at the beginning of the twentieth 
century was the nature of the spectrum emitted by a class of objects called 
blackbodies. The predictions of classical ideas did not fit the experimental re- 
sults. This problem led to the development of what we now call quantum 
mechanics. Relativistic effects do not generally affect computer operation, so 
we will address only the quantum mechanical part of modern physics in the 
remainder of this book. 


17.2 BLACKBODY RADIATION 


All substances at finite temperatures radiate electromagnetic waves. Isolated 
atoms (in a gas) emit discrete frequencies, molecules emit bands of frequen- 
cies, and solids radiate a continuous spectrum of frequencies. 


The details of the spectrum emitted by a solid depend on its temperature 
and to some extent on its composition. At room temperature the spectrum 
is centered around the infrared; that is, most of the radiation emitted lies in 
the infrared part of the electromagnetic spectrum. As the temperature of the 
solid increases, more and more of the emitted radiation is in the visible part 
of the spectrum; we see it first glow red and then approach white as the 
temperature is increased. 

Objects that emit a spectrum of universal character, one that does not 
depend on the composition of the object, are called blackbodies. The reason 
for the name is that these objects absorb all the radiation incident on them. 
They do not reflect light, and hence they appear black. We see them by 
contrast with other objects or their background. Any object painted with a 
dull black pigment is a good approximation to an ideal blackbody. Another 
type of blackbody is a metallic cavity with a small hole (see Fig. 17-1). Any 
radiation entering the hole has a very small probability of being reflected out, 
hence the object (hole) is “black.” After multiple partial reflections by the 
inner walls of the cavity, the radiation is eventually absorbed by the atoms 
in the walls of the cavity. These atoms, in turn, will reradiate electromagnetic 
waves into the cavity and some of it will leak out through the hole. Theo- 
retically, the character of this radiation that leaks out is the same as that of 
the other type of blackbody. 


17.2a Character of the Spectrum of a Blackbody 


The main features of the spectrum emitted by a blackbody are: 


1. The spectrum is continuous with a broad maximum. This fact is shown in 
Fig. 17-2, which is a plot of I(v), the spectral radiancy at each frequency, 
versus the frequency of the radiation. The spectral radiancy is the energy 
per frequency emitted per unit time per unit area of the blackbody. The 
two curves correspond to two different temperatures of the object. 

2. The integral of [(v) over all v, which we call I;, represents the energy 
emitted per unit time per unit area, regardless of the frequency, and it is 
found to increase with the fourth power of the temperature. This empirical 
fact, known as the Stefan-Boltzmann law, states that, 


a in I) dv = oT! 


where the constant 0 = 5.67 X 107° W/m?-K*. This integral is clearly the 
area under the curve for each temperature. 

3. Figure 17-2 also shows that the spectrum shifts toward higher frequencies 
as the temperature increases. In fact, one finds experimentally that the 
frequency Vmax, at which I[(v) is a maximum, increases linearly with the 
temperature of the cavity (blackbody), that is, 


Vmax % T 
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FIGURE 17-1 

A metallic cavity with a small hole 
is an example of a blackbody. Ra- 
diation entering the hole is even- 
tually absorbed after successive 
reflections at the inner walls of 
the cavity. Some of the radiation 
reemitted by the atoms in the 
walls of the cavity leaks out 
through the hole. This radiation 
has the same character as that of 
any other blackbody. 


FIGURE 17-2 

The intensity of the radiation 
emitted by a blackbody as a func- 
tion of the frequency of the ra- 
diation for two different temper- 
atures of the blackbody. Note that 
the total energy (arex under the 
curve) and the frequency at which 
the intensity is a maximum in- 
crease with increasing tempera- 
ture. 
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17.2b Planck’s Theory 


Attempts by physicists to explain the blackbody spectrum using the laws of 
classical electromagnetism and thermodynamics proved unsuccessful. 

On December 14, 1900, at a meeting of the German Physical Society, Max 
Planck (1858-1947) presented a paper entitled, ‘‘On the Theory of the Energy 
Distribution Law of the Normal Spectrum.” This event is considered the 
birthday of quantum mechanics. As we will see, these ideas were at first 
introduced a little bit haphazardly, with no justification other than that they 
accounted for the experimental facts. Eventually, these ideas were fused to- 
gether into a set of fundamental principles by Erwin Schrodinger and Werner 
Heisenberg. 

Planck’s approach to the problem was to find an empirical mathematical 
expression for I(v) that would fit the experimental data. He then observed 
that he could derive the expression by making a revolutionary physical hy- 
pothesis, namely; a system undergoing simple harmonic motion with frequency v 
can only have and therefore can only emit energies given by E = nhv, where n = 1, 
2, 3, ...andh is a constant now known as Planck’s constant. The value of 
h, which resulted in a good fit between the data and the expression found 
by Planck for I(v), is 6.63 x 10~°4 Joule second (J-sec). 

We know that the energy of a harmonic oscillator is proportional to the 
square of the amplitude of the motion (Section 10.6). In a classical treatment, 
such as an oscillating spring, this amplitude may vary continuously from zero 
to infinity. In contrast, Planck postulated that atomic oscillators can have only 
discrete energy values. The classical and Planck’s energy spectra for an oscil- 
lator are contrasted in Fig. 17-3. By Planck’s hypothesis, because an oscillator 
(such as the atoms in the walls of the cavity) can take only certain values for 
the energy, when they lose that energy (by emitting electromagnetic waves), 
they lose it in multiples of hv. These small quantities of energy are called 
quanta (singular, quantum). 

Using the energy spectrum just described for the atomic oscillators, and 
consequently for the electromagnetic waves emitted by them, together with 
simple thermodynamic arguments, Planck derived a rather complicated 
expression for I(v) that matched the experimental data: 


Iv) = 27hv 1 
= c? exp (hv/kgT) — 1 


(17.1) 
where c is the velocity of light, kg is the Boltzmann constant, vis the frequency 
of the electromagnetic wave, and T is the absolute temperature of the black- 
body. 

Every day experience shows that an oscillator, for example, a pendulum 
or a mass connected to a spring, stops oscillating progressively and smoothly, 
not in jumps. Is Planck’s hypothesis in conflict with this macroscopic exper- 
imental observation? Not really. Let us consider a mass m = 10 kg, attached 
to a spring of force constant k = 10° N/m. Let the initial amplitude of the 


E = nhv 


E 


——— a 7 
(a) Amplitude 


—— —— F5=5hv 

——— Fa=4hv 
- ee Hg me Shy 
——————— Fn=2hp 


—E),=hv 


(b) —_____———F,=0 


FIGURE 17-3 
(2) Dependence of the energy of 


a classical oscillator on the am- 
plitude of the motion. Because 
the amplitude of the motion can 
be varied continuously from zero 
to infinity, the energy of an os- 
cillating body can have any value 
between zero and infinity. (0) 
Discrete (quantized) energy spec- 
trum of atomic oscillators as pro- 
posed by Planck to explain the 
spectrum of frequencies emitted 
by a blackbody. 
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motion be A = 0.1 m. From elementary mechanics, we know that 


1 i 
E = kA? = > x 10° N/m x (0.1m? = 5] 


and 


1 |[k 
a —_ 


= 1.59 H 
27 Vm . 


The separation between adjacent energy levels is, by Planck’s hypothesis 
AE = hv = 6.63 X 10~*4 J-sec X 1.59 sec™! =~ 107° J 


This example shows that, for a macroscopic oscillator, the separation between 
the allowed energy states (as postulated by Planck) is extremely small com- 
pared with the energy of the oscillator. The oscillator may be losing energy 
in jumps, but the effect is not noticeable. 


17.3 THE PHOTOELECTRIC EFFECT 


The quantum idea, introduced by Planck to explain the spectrum of a black- 
body, was further expanded by Albert Einstein (1879-1955) in connection 
with the photoelectric effect. Under certain conditions, which we will discuss 
shortly, light incident on a metal will cause electrons to be ejected from the 
surface of the metal. This is known as the photoelectric effect. 

We will summarize an experiment that can be used to study the properties 
of the electrons ejected from the metal. We will then see the failure of classical 
ideas to explain the results and, finally, we will introduce Einstein’s hypoth- 
esis about the nature of electromagnetic radiation and how this hypothesis 
accounts for the experimental facts. 


17.3a Experimental Facts 


An experimental arrangement that can be used to study some of the properties 
of the photoelectric effect is shown in Fig. 17-4. The apparatus consists of an 
evacuated tube with two metal plates, C, the cathode, and A, the anode. 
Monochromatic light (single wavelength) is sent through a quartz window 
onto the cathode C. Because the anode is at a negative potential V with respect 
to the cathode, the electrons, on being emitted by the incident light striking 
the cathode, face a retarding voltage V. To reach the anode the photoelectrons 
must be ejected with a kinetic energy E;, that is greater than the difference in 
potential energy, |e|V, between the anode and the cathode. When E; = |e|V, 
the electrons, on reaching the anode, will be able to contribute to the current 
through the circuit, which is measured by a galvanometer G. The tube is 
evacuated to minimize the collisions between the photoelectrons and the gas 
molecules in the tube. By varying the retarding voltage V, the spectrum of 


Max Planck (1858-1947). 
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FIGURE 17-4 
Experimental arrangement for the 
study of the photoelectric effect. 
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energies with which the electrons are emitted can be determined. Other pa- 
rameters can be varied to see how they affect the energy and the number of 
photoelectrons emitted; these include the intensity I and the frequency v of 
the incident light and the nature of the cathode. A summary of the experi- 
mental results is shown in Fig. 17-5. 

If the value of the retarding voltage V and of the frequency of the light 
are kept constant (Fig. 17-5a), it is seen that the photocurrent i through the 
circuit increases linearly with increasing intensity I of the incident radiation. 
This in turn means that the number of electrons emitted with energies E, = 
le|V increases with I, because i is proportional to the number of electrons that 
are collected by the anode. 

Figure 17-5b shows the dependence of i, and hence of the number of 
emitted electrons capable of reaching the anode, on the value of the retarding 
voltage. The experiment is performed while keeping both the intensity and 
the frequency of the radiation constant. The two curves correspond to two 
different values of I. The result can be easily understood. For small V’s, only 
the electrons emitted with small energies are turned back by the retarding 
voltage, and therefore do not contribute to the current i. As V is increased, 
electrons with higher energies will be turned back, and the current will de- 
crease. When V = Vo (Vo is called the stopping potential), all the electrons, 
including the most energetic ones, are turned back and the current drops to 
zero. Vo is therefore a measure of the maximum energy with which the elec- 
trons are ejected from the cathode, 


lel\Vo = Ex max (17.2) 


Figure 17-5) shows that Vo, and therefore the maximum energy of the photoelec- 
trons, 1s independent of the intensity of the light. 

The dependence of V, on the frequency v of the light can be examined 
by repeating the previous experiment with different v ’s. Figure 17-5c shows 
that Vo (hence EF; max) increases linearly with v. That is, 


Vo =— AV (17.3) 


The value of the slope a is found to be 4.1 xX 107? J-sec/C. The linear de- 
pendence of Vp) on v and the value of the slope remain unchanged if the 
experiment is repeated with a cathode made of a different metal. Figure 17- 
Sc also shows that for frequencies v = v, Vo is zero. This means that no 
voltage is necessary to stop the most energetic electrons; no voltage is needed 
because no electrons are emitted when v = »,. As the graph indicates, the 
value of v. depends on the material used for the cathode. 

There is one final experimental fact that is crucial to the discussion that 
will follow. When the conditions for photoemission are favorable (high enough 
v, low enough V), the emission is almost instantaneous. The photocurrent 
has been observed to occur within 10~? sec from the onset of illumination, 
which is the limit of experimental accuracy. This essentially instantaneous 
emission has been observed to take place with extremely low intensities of 
light, as low as 107 1° W/m’. 


(a) 


dd» constant 


(0) 


(c) Frequency p—— > 
FIGURE 17-5 

(a) The photocurrent (the num- 
ber of emitted electrons) in- 
creases with increasing light in- 
tensity. (b) The number of emitted 
electrons able to reach the anode 
A decreases as the retarding volt- 
age increases. The stopping po- 
tential V, is independent of the 
light intensity. (c) The stopping 
potential increases linearly with 
increasing frequency of the light. 
For frequencies below v,, V, is 
zero because no electrons are 
emitted. The value of v, depends 
on the material being illumi- 
nated (the cathode). The two 
curves are for cesium (Cs) and 
copper (Cu) cathodes respec- 
tively. 


17.3b Failure of Classical Physics to Explain the Results 


According to classical physics, light is an electromagnetic wave (see Section 
16.8). To understand the failure of classical physics to explain the experimental 
results just presented, we need to remind ourselves of two facts about waves: 


1. The energy of a wave is continuously distributed over the entire space 
traversed by the wave. For example, when the ripples in the pond move 
outward from their source, all the water in their path is displaced. 


2. The intensity of a wave, which represents the energy carried by the wave 
per unit area perpendicular to the direction of propagation of the wave, 
per unit time is proportional to the square of the amplitude of the wave 
(Eq. 11.20). In the case of electromagnetic waves it can be shown that 


1 
T= 5 ec € 


where € 9 is the permittivity of free space, c is the velocity of light, and G, 
is the amplitude of the electric field of the wave. 


With these two facts in mind, let us consider an electron that is bound 
with some energy E, to the metal. An electric field @ = Go sin (kx — ot) 
impinges on the bound electron. The electric field will exert a force F = |e/é 
= |e|€, sin (kx — wt) on the electrons. This force will do work on the electrons, 
the amount of work depending on the magnitude of the force. As a result, 
the electric field will increase the energy of the electrons and, if the energy 
that the electrons pick up from the electromagnetic field is greater than the 
binding energy that keeps them in the metal, the electrons will come out of 
the metal with a kinetic energy E;, which is the difference between the energy 
absorbed from the wave and the binding energy E,. As the amplitude of the 
wave increases, the magnitude of the force increases, and so does the work 
done by the electromagnetic field on a given electron. We therefore expect, 
from classical physics, that the energy given to the electron will increase as 
the intensity of the wave increases; detailed calculations show that the energy 
absorbed is, indeed, proportional to the intensity. 

Let us now reexamine the experimental results. The results of Fig. 17-5a 
can be explained in terms of classical concepts. The electrons in the metal are 
bound differently: some more tightly than others. Given a certain intensity 
of the wave and therefore a certain amount of energy available to them, the 
electrons will use it to liberate themselves from the metal; any remaining 
energy will be in the form of kinetic energy of the electrons. For small inten- 
sities only those electrons that are weakly bound will come out with sufficient 
kinetic energy to overcome the retarding potential and to contribute to the 
current. As the intensity is increased the energy available will increase and 
more electrons will come out with sufficient energy to reach the anode. The 
current should increase with increasing intensity, and it does. 

The fact that E;, max is independent of the intensity is difficult to explain 
by classical theory. If you increase I, you increase the energy available to all 
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the electrons, including those that are the least tightly bound and that there- 
fore come out with the maximum kinetic energy. Thus the fact that Vo is 
independent of I (see Fig. 17-5b) cannot be explained by classical ideas. 

The fact that E, max increases with v (see Fig. 17-5c) cannot be accounted 
for by classical physics. As we have seen, the energy of the electromagnetic 


wave depends on its intensity (amplitude squared), not on its frequency. Why 


should V) depend on v? Why is there a v, below which no electrons are 
emitted, no matter how intense the wave is? Classical physics provides no 
answer. 

Finally, the fact that the emission is almost instantaneous plays a key 
role in the rejection of the classical ideas about the nature of electromagnetic 
radiation. If we consider radiation with intensity I = 107 '° W/m’, there is no 
way that the electrons can be emitted in 10~° sec. It should take considerably 
longer. Let us consider a sheet of some metal with an area of 1 m*, as shown 
in Fig. 17-6, and let us assume that light of intensity J = 10~'° W/m? shines 
on it. As we mentioned, the energy of the beam is spread continuously over 
the entire wave front. Let us be optimistic and assume that all the energy 
falling on a certain atomic site of the metal sheet is absorbed by only one of 
the electrons of the atom, the most loosely bound. It is well known, from X- 
ray studies, that the interatomic separation d in a metal is about 2 AQA= 
10~!° m). We can use this fact to find out how many atomic sites there are 
in the first layer of the metal surface. 


The number of atomsin 1m _ 1m 7 ; 
a 1-m-long row =e oF ag im 5 X 10° atoms/row 


For simplicity, let the metal have cubic structure. This means that there are 
as many rows as we have atoms in one row. And, consequently, the number 
of atoms in the first layer of the metal sheet will be (5 x 10%)? = 2.5 x 10” 
atoms/layer. According to our generous assumption 10° '° J/sec-m* are shared 
by 2.5 x 10’? electrons. That is, 


10~ 1° J/sec 


55x 10% =A <%10°™ J/sec-e 


Energy/second-electron = 


It is known from other types of experiments that the minimum binding energy 
of an electron in a metal is typically a few eV’s. Let us take 1 eV. The time 
required for the electron to collect 1 eV from the electromagnetic wave will 
be, 


_ 1leV x 1.6 x 107" J/eV 


— 4 x 101° 
4 x 10~°° J/sec a 


f 


or ~ 10° days 


Thus, we see that classical physics also fails to explain the short time release 
of electrons in the photoelectric effect. 


—— 


FIGURE 17-6 


17.3c Einstein’s Theory 


In 1905, five years after Planck’s historic paper, Einstein was able to explain 
the photoelectric effect by proposing a theory about the nature of electro- 
magnetic radiation that was dramatically different from that of classical elec- 
tromagnetism. 

According to Einstein, the energy of an electromagnetic wave of frequency v 
is not continuously distributed over the entire wave front, but instead it 1s localized 
in small bundles (particle-like entities) called photons. The energy of each photon 
iS Epnoton = hv, where h is Planck’s constant (see Section 17.2b). 

Basically, according to Einstein, a beam of electromagnetic radiation car- 
ries energy like a beam of particles, not like a wave. Within the Einstein 
hypothesis, the intensity of the beam is a measure of the density of photons 
in the beam. Increasing the intensity without changing the frequency does 
not change the energy of the individual photons, but rather the number of 
photons per unit volume of the beam, and thus the energy density of the 
beam. 

Einstein visualized the photoelectric effect as a particle-particle collision 
in which a photon of energy hv collides with an electron in the metal and 
imparts all its energy to the electron. From conservation of energy principles, 


hv = Ex + E, (17.4) 


where E, is the energy with which the particular electron is bound to the 
metal and E;, is the kinetic energy with which that electron is ejected. From 
Eq. 17.4 it is clear that the value of E; will depend on how tightly bound a 
given electron is. The smaller E,, the larger E, will be. We can rewrite Eq. 
17.4 as 


Ex max = hv — (17.5) 


where ¢ is the minimum binding energy and is called the work function of the 
metal. (Note that @ is not an angle but a symbol for energy in this case.) 
Experimentally, Ex max is measured by determining the stopping potential Vo 
and E; max = eVo, thus Eq. 17.5 can be rewritten as 
jp" (17.6) 
€ € 

We can now explain all the experimental data presented at the beginning 
of this section. The greater the intensity I, the larger the number of photons 
that will strike the metal cathode every second. This will result in a greater 
number of photon-eléctron collisions and a subsequent increase in the number 
of electrons emitted. Thus the results of Fig. 17-5a are explained. 

Increasing the intensity increases the number of photons, not the energy 
of the individual photons. The maximum energy (and therefore Vo) should 
not depend on the number of photons (on intensity /) but rather on the energy 
of each photon, that is, on the frequency of the wave. In fact, Eq. 17.6 shows 
that V, should increase linearly with v. This is in agreement with the result 
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presented in Fig. 17-5c. The slope of the curve according to Eq. 17.6 should 
be equal to h/e = 4.1 x 10~* J-sec/C, which is the observed value. It should 
be clear that if the energy of the photons is less than the work function of 
the metal, no electrons can be ejected. That is, if 


hv<@® or y< fay, (17.7) 


no photoemission will take place. The cut-off frequency v, is accounted for. 

Finally, the emission is instantaneous because the process is not one in 
which the electrons progressively gather energy until they have enough to 
come out. It is a particle-particle collision. If just one photon with energy hv 
= ¢ collides with an electron, the latter will be immediately ejected. 


Example 17-1 


The eye is capable of detecting 10 eV of light energy. If we take as the average 
wavelength of light 6000 A, how many photons is the eye capable of detecting? 


Solution According to Einstein’s theory, the energy of a photon is given 
by hv. Using this, together with the fact (Eq. 11.2) that the product of the 
wavelength and the frequency equals the velocity of propagation of the wave, 
which in the case of light is c, that is, 


Av=c (17.8) 


we obtain 
Energy/photon = hv = 


6.63 x 10~%4 J-sec x 3 xX 10® m/sec 
6000 x 1071° m 


3.32 x 107’? J = 2.07 eV 


10 eV 


= ——_————— __ = 5 phot 
2.07 eV/photon ial 


Number of photons 


Example 17-2 


The cut-off frequency for photoemission in copper is 1.0 x 10!° Hz. What is 
the maximum kinetic energy of the photoelectrons emitted when light of 
wavelength 1000 A is shone on a copper surface? 


Solution The work function is given by Eq. 17.7 
o@ = hv, = 6.63 X 10~* J-sec x 1.0 x 10% Hz = 6.63 x 10719] 


p 
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From Eq. 17.5 
he 
Ek max = hy -— $= > — 


6.63 X 107% J-sec x 3 x 108 m/sec 
— ; x = 19 
1000 x 107! m ea 


13.26 x 10°'° J = 8.29 eV 


17.4 FURTHER EVIDENCE FOR THE PHOTON THEORY 


There exists today a large number of experimental results that confirm the 
particle nature of electromagnetic radiation. In this section, we will discuss 
qualitatively two effects that contribute further evidence to the theory. 


17.4a X-ray Production 


In 1895, Wilhelm K. Roentgen (1845-1923) discovered that when highly en- 
ergetic electrons struck a solid target, a strange (hence the name X rays) kind 
of radiation was produced. The radiation was highly penetrating: It passed 
through objects that were opaque to light. Moreover, the radiation was not 
deflected by either electric or magnetic fields, indicating that it did not consist 
of charged particles. The mysterious nature of X rays vanished a few years 
later when in 1912 Max von Laue (1879-1960) found that they could be dif- 
fracted. The diffraction of X rays by crystalline solids was discussed in Chapter 
12. These experiments proved that X rays are a form of electromagnetic ra- 
diation. Their wavelength, however, is much smaller than that of light waves: 
typically A~ 1A. 

Figure 17-7 shows a schematic of the experimental arrangement used to 
produce X rays. Electrons from a heated filament F are accelerated by a large 
potential difference V (several thousand volts). As a result, they enter the 
target T with a kinetic energy E, = eV. On striking the target, X rays are 
emitted. 


FIGURE 17-7 


Apparatus for the production of 
——— isin X rays. | 
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An analysis of the spectrum of the emitted X rays reveals several inter- 
esting features. We will concern ourselves primarily with one of them that is 
pertinent to the photon hypothesis. The emission spectrum is continuous, 
with one very important feature: a sharp, well-defined cut-off on the small 
wavelength side. These facts are shown in Fig. 17-8, which is a schematic 
plot of the intensity I of the emitted X rays versus wavelength A. The value 
of the cut-off wavelength A,nin is independent of the target material but de- 
pends on the accelerating voltage V by Eq. 17.9 

1 


) OC V (17.9) 


Using the fact that Av = c (Eq. 17.8), we can rewrite this result as, 
Vmax © V (17.10) 


Although it is not relevant to our present discussion, we should point out, 
for the sake of completeness, that in addition to the continuous spectrum 
there are several intensity peaks, as shown in Fig. 17-8, called the characteristic 
X rays. The wavelengths of the characteristic X rays are independent of the 
voltage V but depend on the material of the target. 

Let us try to understand the origin of the spectrum presented in Fig. 
17-8. When the incoming electron enters the target, it will interact with the 
atomic electrons and with the nuclei present there. The interaction with the 
atomic electrons is the process that is primarily responsible for the slowing 
down of the incident electrons. Through multiple collisions, an incident elec- 
tron is progressively slowed down and loses its energy to the target: The 
kinetic energy of the bombarding electron becomes heat. Occasionally, an 
electron-electron collision may occur, which results in a large transfer of en- 
ergy from the incident electron to an atomic electron. As a result of this 
collision, the atomic electron will be knocked out of an atom. In subsequent 
chapters we will see that the electrons in the atom occupy discrete energy 
levels. When one of these energy levels is vacated as a result of a collision, 
one of the outer electrons in the atom falls down into the vacated energy 
state and in the process gives off a photon. The energy of this photon hp, 
will be equal to the energy difference between the two atomic levels involved 
in the transition. These photons account for the characteristic X-ray peaks. 
Because, as will be shown in later chapters, the atomic energy levels are 
determined by the structure of the particular atom, we can understand why 
the frequency of the characteristic X rays depends on the target material and 
not on the energy of the incident electrons, that is, on the accelerating volt- 
age V. . | | 
The incident electrons can also interact with the nuclei in the target. This 
interaction is responsible for the continuous spectrum or, its customary name, 
bremsstrahlung (braking radiation). Let us consider an electron with energy E, 
approaching a positively charged nucleus. The electron, as a consequence of 
the coulombic attraction of the nucleus, will be deflected from its straight- 
line path; that is, it will be radially accelerated (see Fig. 17-9). Classical elec- 
tromagnetic theory predicts that an accelerated charge will radiate electro- 


FIGURE 17-8 

Intensity of X rays emitted ver- 
sus the wavelength of the X rays. 
Note that no X rays of wave- 
length less than a critical value 
Amin are emitted. 
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magnetic waves continuously and of all frequencies. Thus, from the classical view 
of radiation, it is impossible to understand why there is a wavelength cut-off 
in the emission spectrum. The cut-off can be explained rather simply by the 
photon model of electromagnetic radiation. The accelerated electron will ra- 
diate energy not continuously but in quanta of energy hv. If now we consider 
the electron of Fig. 17-9 approaching the nucleus with energy EF;, emitting 
several photons of energy hv, hv2, and so on, and finally leaving the nucleus 
with energy Ez, we can, from energy conservation considerations, write 


E, - Ee = hy + hy +::: 


We have assumed that the nucleus does not acquire any energy during the 
collision. This is a good approximation because the nucleus is much heavier 
than the electron. It is now easy to understand the existence of the cut-off 
frequency. The most energetic photon that can be produced by the interaction 
of the electron with the nucleus is the one that is produced when the electron 
loses all its energy in the emission of a single photon. In such a case E; is 
zero and therefore | 


Ex = hVmax 
But because E, = eV, we conclude that 


i SV (17.11) 
h 

Equation 17.11 predicts the experimental observation given by Eq. 17.10. Not 
only can the photon model predict the existence of the frequency cut-off and 
its proper dependence on the accelerating voltage, but the experimental re- 
sults can be used to determine the value of Planck’s constant. The value of 
h obtained in this case agrees with the values obtained by Planck in connection 
with blackbody radiation and by Einstein in the case of the photoelectric effect. 


17.4b Compton Effect 


In 1923, Arthur H. Compton (1892-1962) performed a series of experiments 
that provided a dramatic confirmation of the photon nature of electromagnetic 
radiation. A schematic of the experimental arrangement is shown in Fig. 


FIGURE 17-9 

Schematic representation of an 
electron interacting with a nu- 
cleus in the target and emitting 
X rays in the process. 


I 


Arthur Holly Compton (1892- 
1962). 
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FIGURE 17-10 

Apparatus used for the Compton effect 
experiment. Monochromatic X rays of 
wavelength A are scattered by a graphite 
target. The wavelength of the scattered 
X rays is determined for different an- 
gles 0. 


17-10. A beam of X rays of sharply defined wavelength was sent onto a 
graphite target. Compton then studied the scattered radiation to see what 
wavelengths were present in it. This was done for different angles 0, between 
the incident and the scattered beams. Whereas the incident beam consisted 
of X rays of wavelength A = 0.709 A, Compton observed that the scattered 
beam contained two intensity maxima: one at A = 0.709 A and the other at 
a A’ greater than that of the incident radiation. The value of A’ increased as 
the angle of scattering increased and reached a maximum value of 0.758 A 
for 6 = 180°. Figure 17-11 shows the intensity of the scattered radiation as a 
function of wavelength for four particular values of 6. 

To understand how these results support the photon hypothesis, let us 
first consider what classical electromagnetic theory predicts about the scat- 
tering of electromagnetic waves. When an electric field € = Gp sin (kx — wt) 
impinges on an electron in the target material, the electron will experience a 
force F = eé = e€y sin (wt — kx), see Eq. 14.1. As a result of this force, the 
electron will oscillate with a frequency equal to that of the force, that is, the 
frequency of the incident electromagnetic wave. On the other hand, according 
to classical electromagnetic radiation theory, a charged particle (the electron 
in this case) undergoing simple harmonic motion radiates electromagnetic 
waves of the same frequency as the frequency of the motion of the charged 
particle (see Section 16.8). The electron plays the role of a transfer agent: It 
absorbs energy from the incident beam and reradiates this energy at the same 
frequency in all directions. Thus, classical electromagnetic theory cannot ex- 
plain the presence of a longer wavelength (smaller frequency) in the scattered 
beam. . 

Compton explained the shift in the wavelength of the scattered beam by 
considering it to be a beam of photons, each with energy E = hv and mo- 
mentum p = hvic = h/d (see the Supplement at the end of this chapter). 
According to Compton, the photons collide with the electrons in a particle- 
particle-like collision (see Fig. 17-12). In the collision, the electron will acquire 
some momentum and energy at the expense of the photon. From consider- 


FIGURE 17-11 

Intensity of the scattered X rays 
in the Compton experiment of 
Fig. 17-10 as a function of the 
wavelength for four different an- 
gles 6. (Source: Kenneth Krane, 
Modern Physics. Copyright © 1983 
by John Wiley & Sons, Inc. Re- 
printed by permission of John 
Wiley & Sons, Inc.) 
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ation of the conservation of energy we conclude that the energy of the photon 
and, hence, its frequency will decrease: the wavelengths of the scattered 
photons will be longer than that of the incident photons. Clearly, the stronger 
the collision, the larger the angle of scattering of the photon and the greater 
the energy lost to the electron; the shift in frequency should increase with 
increasing 9. In fact, we expect that the maximum energy transfer will occur 
in the case of head-on collision, which will result in backward scattering 
(9 = 180°) of the photon. These arguments can be made quantitative by writing 
explicitly the conservation of energy and momentum equations. The solution 
of these equations, which can be found in most modern physics textbooks, 
yields the frequency and wavelength of the scattered photons as a function 
of 6. The result, originally derived by Compton, is 
h 

r A= om (1 — cos 6) (17.12) 
where X’ and A are, respectively, the wavelengths of the scattered and the 
incident photons; m is the mass of the scattering particle, the electron, and 
c is the velocity of light. An inspection of Eq. 17.12 corroborates the qualita- 


tive arguments presented earlier. In particular, A’ — A is a maximum when 
cos 6 = —1, that is, when 6 = 180°. This maximum shift in wavelength is 
therefore 
2h 2 X 6.63 x 10° * J- 
ie = a = J-sec 


mc 9.1 x 10-3! kg x 3 x 10% m/sec 


— 0.049 x 107° m = 0.049 A 


For 0 = 90° 
h 
dN’ = 2A + —(1 — cos 90°) 
mec 


0.709 x 10°'°m 


6.63 x 10° °4 J-sec 


4 
91x 10°! kg x 3 x 10° freee ” 


= 0.733 X 107! m = 0.733 A 


which agrees with the results shown in Fig. 17-11c. 

The unshifted wavelength present in the scattered beam can also be 
explained in terms of the photon model. Up to now, we have assumed that 
the photons collide with the electrons in the target material. The photons can 
also collide with the atoms in the graphite. In this case, the only change to 
be made in the calculations consists in replacing in Eq. 17.12 the mass of the 
electron by that of the carbon atom. For graphite, Matom ~ 24,000 Metectron. 
From Eq. 17.12, we can see that the shift in wavelength will be 24,000 times 
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FIGURE 17-12 
Scattering process of the X rays 


in the Compton experiment of 
Fig. 17-10. A photon collides with 
an electron in the graphite target 
in a_particle-particle-like colli- 
sion. The photon imparts some 
energy to the electron, resulting 
in a decrease in its own energy 
(and therefore in its frequency) 
and a concommitant increase in 
its wavelength. (a) Before the col- 
lision. (b) After the collision. 
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smaller, that is, 


0.049 A 


me =2x10-°A 
A) max 24,000 


(\’ - 


This is an insignificant and unobservable amount when we compare it with 
A = 0.709 A. 


Example 17-3 


X rays of wavelength A = 0.700 A are Compton-scattered by the electrons in 
a graphite target. (a) What is the wavelength of the X rays scattered at an 
angle 6 = 120°? (b) What is the kinetic energy of the scattering electrons if 
they were originally at rest? (c) What is the scattering angle of the electrons? 


Solution 


(a) From Eq. 17.12 
h 
A’ = A + — (1 — cos 120°) 
mc 


0.700 x 107!° m 


6.63 x 10~* J-sec 


(1 + 0. 
9.1 x 107%! kg x 3 x 10° m/sec Ce 


0.736 X 107! m = 0.736 A 


lI 


(b) From conservation of energy principles, the kinetic energy of the 
electron is equal to the energy lost by the photon, that is, 


Ex = hy — hy’ 


which from Eq. 17.8, Av = c, can be written as 


= 6.63 x 10~* J-sec 


| 1 1 Pe 
aoe ee ee (0) | 
3 x 10° m/sec | 5300 x 10°" m_—0.736 x 107*° =| FIGURE 17-13 | 


Example 17-3. (a) Before the col- 
1.39 x 107-'°J = 869 eV lision. (b) After the collision. 
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(c) From conservation of linear momentum principles 
p, (before the collision) = p, (after the collision) 


Writing this explicitly (see Fig. 17-13 and Eq. 17.18 in Supplement 


17-1). | 
h ’ 
oe pe cos @ — —— cos 60° 
C C 
therefore 
h ’ 
Pe COS b = + — cos 60° 
h oh 
= — + — cos 60° 
A ON 
1 0.5 
; = 6.63 x 107% J-sec | ———____ + ———_—_ 
ek a aa x 107m ~ 0.736 x 10-9 =| 
pe cos @ = 13.98 x 10~*4 kg m/sec (17.13) 


Again from conservation of momentum, 
p, (before the collision) = p, (after the collision) 
h / 
0= —p. sind + =e sin 60° 


Therefore 


/ 


h h 
p. sin b = sin 60° = +; sin 60° 


_ 6.63 x 10~** J-sec 
Pe SIN ® = 0736 x 107 m 


x (0.866) 
p. sin @ = 7.80 x 10° kg m/sec (17.14) 
Combining Eqs. 17.13 and 17.14, we obtain 


7.80 x 10-*4 kg m/sec 
Se = 8 
acid 13.98 x 10-74 kg m/sec 


db = 29.2° 
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SUPPLEMENT 17-1 


Momentum of the Photon 


From Einstein’s theory of special relativity, the total relativistic energy of a 
particle is given by 


E= Ex ss Eo 


where E;, is the kinetic energy of the particle, Ej = moc? is the rest energy (mp 
is the rest mass of the particle, that is, the mass of the particle when its 
velocity is zero), m is the relativistic mass (the mass when the particle is 
moving), and c is the velocity of light. For the photon, the rest energy and 
therefore the rest mass is zero; a photon at rest does not exist. 

We can now use Einstein’s postulate, namely, Epnoton = Av and, com- 
bining this with the expression for the total relativistic energy, Eq. 17.15, we 
can get an expression for the momentum of the photon. 


= — 2 
Ephoton = hv = mc 


Dividing by c, we obtain 


hv 
mc = — 
C 
Using Eq. 17.8, we obtain 
h 
mes = 
r 


(17.16) 


(17.17) 


We recognize the left side of Eqs. 17.16 and 17.17 as the momentum p of the 
photon, that is, the product of the mass and the velocity of the photon, 


h _ hy 


Pphoton — nN ; 


PROBLEMS 


17.1 The wavelength A,,ax for which the spectral 
radiancy, I, of a blackbody is a maximum, is given 
by Wien’s displacement law, which states: 


Amaxl = 2.898 x 107° m-K 


where T is the absolute temperature of the black- 
body. The temperature of the surface of the sun is 
roughly 5800 K. (a) What is the wavelength of the 
most intense radiation emitted by the sun? (b) In 


(17.18) 


what part of the electromagnetic spectrum is this 
radiation? 


17.2 The rate at which the sun’s energy strikes the 
earth is known as the solar constant and has a value 
of 2 cal/cm?-sec. If we assume that there is no re- 
flection by the clouds, ice caps, or such, the earth 
would absorb all this energy. The earth would be a 
blackbody radiator and would radiate all the energy 


that it receives back into space. What would be the 
equilibrium temperature of the earth if our assump- 
tion were true? 

(Answer: 1100 K.) 


17.3. The radius of a hydrogen atom is approxi- 
mately 10~?° m (1 A). Light of intensity 1.0 W/m? is 
shone on such an atom. What is the time lag for the 
photoelectric effect on the basis of the wave theory 
of light? The binding energy of the electron in the 
hydrogen atom is 13.6 eV. 

(Answer: 69.3 sec.) 


17.4 A 5000-W radio transmitter emits radiation of 
frequency v = 1100 kHz. How many photons per 
second does it emit? 


17.5 Consider a 100-W sodium vapor lamp radiat- 
ing energy uniformly in all directions. Assume that 
80% of the energy radiated is in the form of photons 
of wavelength 5890 A. (a) What is the rate of pho- 
ton emission by the lamp? (b) How far from the 
lamp will the average density of photons be 2 pho- 
tons/cm*-sec? (c) What is the photon flux (that is, 
the number of photons per unit time per unit area) 
2.0 m from the lamp? 
(Answer: (a) 2.37 xX 10” sec™', (b) 3.07 x 10’ m, 
(c) 4.71 x 10“ sec™*cm™~*.) 


17.6 Fora quick estimate of the energy of a photon 
in eV, physicists use the relation E(eV) = 12,345/A(A). 
By what percentage is this relation inaccurate? 


17.7 The basis for the creation of the latent image 
on a photographic negative is the dissociation of 
molecules of silver bromide, AgBr. The heat of dis- 
sociation of AgBr is 24 kcal/mole. Find the longest 
wavelength of light that is just able to expose the 
negative, that is, dissociate AgBr. 

(Answer: 11,900 A.) 


17.8 A metal has a work function ¢@ = 1.5 eV. (a) 
What is the stopping potential for light of wave- 
length 3000 A? (b) What is the maximum velocity of 
the emitted photoelectrons? 


17.9 Light of wavelength 1500 A falls on an alu- 
minum surface having a work function of 4.2 eV. 
(a) What is the kinetic energy of the fastest emitted 
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photoelectrons? (b) What is the stopping potential? 
(c) What is the cut-off frequency v,. for aluminum? 
(Answer: (a) 4.09 eV, (b) 4.09 V, 

(c) 1.01 x 10° Hz.) 


17.10 When light of wavelength 2000 A is incident 
on the surface of a metal, the electrons are emitted 
with a maximum kinetic energy of 2.0 eV. (a) Cal- 
culate the energy of the incident photons. (b) What 
is the work function of the metal? (c) If the incident 
light had a wavelength of 6000 A, what would be 
the stopping potential? 


17.11. The cut-off frequency for photoemission for 
a given metal is vp. What is the maximum energy of 
the emitted electrons when the metal is illuminated 
with light of frequency 3? 


17.12 When light of frequency is incident on a 
certain metal surface electrons are emitted with a 
maximum kinetic energy of 15 eV. When the fre- 
quency is reduced to 1/2, the maximum kinetic en- 
ergy is 3 eV. What is the cut-off frequency for pho- 
toemission for this metal? 

(Answer: 2.17 xX 10'° Hz.) 


17.13 Ina photoelectric effect experiment, it is found 
that when the surface of sodium metal is illuminated 
with light of wavelength A = 4200 A, the stopping 
potential Vp = 0.65 V. When the metal is illuminated 
with light of wavelength A = 3100 A, the stopping 
potential is V) = 1.69 V. Calculate Planck’s constant 
from these data. 


17.14 Not every photon striking the surface of a 
metal undergoes a collision with the electrons in the 
metal. An important quantity in the extension of the 
photoelectric effect theory is the quantum efficiency, 
namely, how many photons are required on the av- 
erage to yield one photoelectron. In a typical exper- 
iment, light of wavelength A = 4366 A is shone on 
a potassium surface. The observed yield is 8 x 10° 
C/J. How many photons are required to yield one 
photoelectron? 

(Answer: 44 photons.) 


17.15 Electrons in an X-ray tube are accelerated 
through a potential difference of 5000 V. What is the 
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maximum frequency and the minimum wavelength 
of the X rays produced? 


17.16 Alpha particles (charge + 2e) are accelerated 
by an electric potential difference of 20,000 V. The 
a particles strike a metal target and in the process 
produce X rays. Find the smallest wavelength of the 
X rays emitted by the target. 

(Answer: 0.311 A.) 
17.17 A photon of frequency v = 3 x 10" Hz is 
Compton-scattered by an electron initially at rest. 
After the collision, the electron moves in the direc- 
tion of the incident photon. (a) Find the wavelength 
of the scattered photon. (b) What is the energy of 
the scattered electron? 


17.18 X rays of wavelength 1 A are Compton-scat- 
tered by the electrons in a carbon target. (a) Calculate 
the wavelength of the X rays scattered at 90° with 
respect to the incident X rays. (b) What is the energy 
of the electrons causing the scattering? 


17.19 Ina Compton experiment the wavelength of 
the incident photon is 1.3249 A, whereas that of the 
scattered photon is 1.3461 A. (a) At what angle is 
the photon scattered? (b) At what angle is the elec- 
tron scattered? (c) What is the kinetic energy of the 
scattered electron? 

(Answer: (a) 82.7°, (b) 48.1°, (c) 148 eV.) 
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Neils Bohr’s original sketches of Hydrogen and Helium. 


CHAPTER 18 
Atomic Models 
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18.1 INTRODUCTION 


By the beginning of the twentieth century, several types of experimental data 
existed, including the photoelectric effect, that indicated that the atom con- 
tained electrons. On the other hand, the atoms were known to be electrically 
neutral. This implies that the atom carries an amount of positive charge equal 
in magnitude to the charge of its electrons. 

Even for the lightest of elements, the mass of the atom is thousands of 
times greater than that of the electron. The conclusion is obvious. Most of 
the mass of the atom resides either with the positively charged matter or with 
neutral matter. However, at that time neutral matter was neither known nor 
conceived of. 

Out of these facts, J. J. Thompson (1856-1940) proposed an atomic model. 
The positively charged matter was assumed to have a continuous, uniform, 
spherical distribution. The radius of this charge and mass distribution was 
assumed to be the known radius of the atom, (~ 10°'° m). Within this pos- 
itively charged mass, the electrons were uniformly “sprinkled” like the “plums 
in the pudding.” Although this model is in agreement with the facts just 
mentioned, it was not consistent with the known facts of electromagnetic 
theory. However, the real blow to the Thompson model came as a result of 
the work of one of Thompson’s former graduate students, E. Rutherford. 


18.2 THE RUTHERFORD MODEL 


18.2a The Scattering of a Particles 


In 1911, E. Rutherford (1871-1937) and two co-workers, H. Geiger and E. 
Marsden, did a series of experiments in which positively charged a@ (alpha) 
particles, which are actually the nuclei of the helium atoms, emitted by certain 
radioactive materials were sent through a collimator and then shot through 
thin metallic foils. (A collimator is a set of closely spaced plates with small 
aligned holes. The plates are opaque to the radiation falling on them. As a 
result of this geometric arrangement of the plates, only the radiation incident 
perpendicularly on the collimator is transmitted through the holes in the 
plates.) They studied the angular dependence of the scattered a particles, 
that is, what fraction of the a particles were scattered at a given angle 0 (see 
Fig. 18-1). 

What they found was that most of the @ particles were scattered in the 
forward direction at very small angles. A few, however, were deflected through 
large angles. For example, in one experiment a@ particles with kinetic energy 
E, = 7.7 X 10° eV were shot through a gold foil 10~° m thick. It was then 
observed that approximately 1 in 10* a particles were scattered through an 
angle 6 = 90°. When a detailed calculation using the Thompson model was 


Joseph John Thompson (1856- 
1940). 
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Ernest Rutherford (1871-1937). 


made, it was found that this fraction should not be 1 in 10* but rather 1 in 
10°°°°. This was the fatal blow to the Thompson model. 

Rutherford suggested that the experimentally observed large-angle scat- 
tering of a particles could be explained on the basis of a “nuclear planetary 
model’. This model proposes the following arrangement. All the positive 
charge and consequently almost all of the mass of the atom is concentrated 
in a very small volume at the center of the atom (r ~ 107~ '* m). The electrons 
orbit this nucleus, like planets around the sun, with the electrostatic attraction 
of the positive nucleus supplying the centripetal force. The radius of the orbits 
is of the same order of magnitude as the size of the atom, that is, r~ 107 '9 m. 
Using this model and newtonian mechanics, Rutherford was able to predict 
the experimental angular dependence for the scattered a particles. 


18.2b Difficulties of the Nuclear Planetary Model 


Although the planetary model proposed by Rutherford was able to explain 
properly the scattering of a particles, it presented new problems. Solution of 
these problems required a drastic modification of physical concepts and led 
to the first quantum mechanical model, the Bohr model, which we will ex- 
amine shortly. 

Let us consider the simplest of atoms, the hydrogen atom. An electron 
of charge g = —e rotates in a circular orbit of radius r under the electrostatic 
attraction of the nucleus (g = +e) (see Fig. 18-2). For simplicity we will 
assume that the nucleus remains stationary. To understand the difficulties of 
the model, we will derive expressions for the total energy of the atom and 
for the frequency of rotation. 


EroraL = kinetic energy + potential energy 


1 1 e 
E =n = = , 
TOTAL ~ 5 MU re (18.1) 


We can express Eroray in terms of the radius of the orbit alone, by making 
use of the fact that it is the electrostatic attraction of the nucleus that supplies 
the centripetal force that causes the circular motion of the electron. 


S radial — MAradial 
1 @e uv (18.2) 
z= = m— , 
Are, 1 r 


Multiplying both sides by (1/2) r 


(18.3) 
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FIGURE 18-1 

Schematic of Rutherford a par- 
ticles scattering experiment. The 
number of @ particles scattered 
by the atoms in the metal foil is 
measured for different values of 
the scattering angle 0. 


FIGURE 18-2 

Electron in the hydrogen atom 
moving around the positive nu- 
cleus in a circular orbit. 
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Substituting this result for 1/2 mv? in Eq. 18.1, we get 


1 1 @é 
Evora, = — > A4ne, 7 (18.4) 
0 


The frequency of rotation v can be expressed as 


_ velocity  v_ (18.5) 
- distance  2ar 


We can now express v as a function of the radius of the orbit alone. From 


Eq. 18.2, 
1 e2 1/2 
= — 18. 
: = _| ee 
We substitute for v in the expression for the frequency (Eq. 18.5) and obtain 
e2 1/2 1 
=a mr cael aa 18.7 
7 = =; is ee 


The difficulties presented by the Rutherford model with respect to clas- 
sical physics will now be explained. A particle moving in a circle with constant 
frequency w = 27v is equivalent to a particle simultaneously undergoing 
simple harmonic motion in two mutually perpendicular directions. This can 
be readily seen. From Fig. 18-2, the x and y coordinates of the electron are 
given by 


x = rcos 86 =r1rcos wt = rcos (271) 


y =rsin 6 =r sin wt = r sin (2771) 


which are expressions for simple harmonic motion in the x and y direction, 
respectively, see Eq. 10.8. 

Classical electromagnetism oredicts that a charged particle undergoing 
simple harmonic motion should radiate electromagnetic waves of the same 
frequency as the frequency of vibration of the particle. The electron moving 
in an orbit of radius r should radiate electromagnetic radiation. The frequency 
of this radiation will be given by Eq. 18.7. Because this electromagnetic ra- 
diation carries away energy, the energy of the atom must decrease as energy 
is radiated from it. From Eq. 18.4 this implies that the radius of the orbit of 
the electron will decrease continuously as the atom loses energy, until the 
electron eventually collapses into the nucleus. Thus, according to principles 
of classical physics, the Rutherford model is unstable. Moreover, because the 
frequency of the motion depends on r (see Eq. 18.7), as the radius of the orbit 
decreases continuously, the frequency of the motion, and correspondingly 
the frequency of the emitted radiation, will increase continuously. By the 
classical model the spectrum emitted by such an atom should be a continuous 
one. The experimental facts are: The hydrogen atom is not unstable and, as 
will be discussed in Section 18.3, the spectrum of hydrogen is not continuous. 
Instead, it consists of certain discrete frequencies. 
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18.3 THE SPECTRUM OF HYDROGEN 


A typical experimental arrangement used to study the electromagnetic spec- 
trum emitted by an element is shown in Fig. 18-3. A small amount of the 
element in the gaseous state is introduced into a glass tube containing two 
metallic electrodes. An electronic discharge is produced by applying a high 
voltage between the two electrodes. This discharge causes the gas in the tube 
to emit electromagnetic radiation. A sample of the radiation emitted is sent 
through a collimator and through a diffraction grating. We may consider the 
resulting interference pattern as it is projected onto a screen. The interference 
pattern of a grating is similar to that of a double slit (see section 12.4). It 
consists of a central maximum (6 = 0) that contains all the wavelengths present 
in the incident radiation. In addition, however, secondary maxima will be 
observed on the screen at locations where the condition is the same as that 
of a double-slit pattern (see Eq. 12.6) 


dsnO0=nd n-=1,2,3,°°: (18.8) 


Recall that d is the separation between slits in the double-slit condition, and 
this is also the separation between adjacent slits in the diffraction grating. If 
the incident radiation contains several wavelengths (and frequencies), the 
angles at which these interference maxima occur will be different for different 


A 
wavelengths because from Eq. 18.8, sin 6 = n 7° We can therefore separate 


and measure the wavelengths present in the radiation. 

Using methods similar to the one described, scientists in the nineteenth 
century had investigated the spectra of a large number of substances. For 
example, in the case of hydrogen, they had found that the spectrum consists 
of families of lines (wavelengths); the lines of a given family could be fitted 
to a simple empirical relation known as the Rydberg-Ritz formula, which 
yields a series of lines 


(18.9) 


Collimators 


Diffraction 
grating 


Screen 


FIGURE 18-3 
Apparatus used to measure the 


wavelengths present in the ra- 
diation emitted by the atoms of 
a gas. An electronic discharge, 
produced by a high voltage be- 
tween two electrodes in the tube 
containing the gas, causes the 
emission of electromagnetic ra- 
diation by the atoms of the gas. 
A sample of this radiation is in- 
cident on a diffraction grating. 
The angle 6 at which the inter- 
ference maxima for the different 
wavelengths occur is measured. 
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where R = 1.0967757 x 10’ m~! (R is known as the Rydberg constant) and 


n, and nj are integers. For a given m, nj = m + 1,m + 2,m, 4+ 3,.... For 
example, 
It m = 1, then n; = 2, 3, 4,-++ This family is known as the 


Lyman series 


It m = 2, then n; = 3, 4, 5,-++ This family is known as the 


Balmer series 
If Ny 


4,5, 6,°** This family is known as the 
Paschen series 


3, then 1; 


It nm = 4, then n; 


5, 6, 7,*** This family is known as the 
Brackett series red 


The Lyman series corresponds to frequencies in the ultraviolet part of the 
spectrum, the Balmer series to frequencies in the visible, and the other series 
to those in the infrared, each named after its discoverer. It should be em- 
phasized that Eq. 18.9 was a purely empirical relation; no physical explanation 


existed. — 
Example 18-1 blue 
What are the shortest and longest wavelengths of the Lyman series? violet 
Solution From Eq. 18.9 the longest wavelength corresponds to the small- ultra 

eat violet 

est value of n;, which for the Lyman series is 2. Therefore 
1 1 1 
A ~s & 7 y 
longest Photograph of the visible part of 


the spectrum emitted by hydro- 
gen (the Balmer series). 


1.0968 x 10? m~! [1 — 0.25] 
= 0.8226 x 10’ m7! 


1 
Monsest ~ 9.8996 x 107 m7 


= 1.215 x 10°7m = 1215A 
The shortest wavelength corresponds to the largest value of n, that is ~. 


1 1 


: 1.0968 x 107 m7! (3 = 4 


Achowect 


1? 00 
= 1.0968 x 10” m™! 


1 
As ortes = = 
hortest “" 1.0968 x 107 m7! 


0.9117 x 10-7 m= 912A 


18.4 THE BOHR ATOM 


18.4a Bohr’s Postulates 


To avoid the two problems encountered by the Rutherford model (instability 
and continuous spectrum) and to explain the spectral data that we have 
discussed in the previous section, Neils Bohr (1885-1962) proposed a model 
of the hydrogen atom that can be summarized in three postulates. 


Postulate 1. Instead of the infinite number of orbits, with different radii, 
which are possible in classical mechanics, the electron can take only certain 
orbits: those for which the angular momentum L takes values given by 


L = mor = nh n= 1,2,3,°°° (18.10) 


where ft is Planck’s constant divided by 27. 


Postulate 2. Contrary to the predictions of classical electromagnetic the- 
ory, an electron in one of the allowed orbits does not radiate electromagnetic 
radiation. 


Postulate 3. If an electron is initially in an allowed orbit of energy E; and 
goes into another orbit of lower energy E;, electromagnetic radiation will be 
emitted with a precise frequency given by 


E; — E 
p= ——! (18.11) 


The first postulate uses the quantization idea, that is, the fact proposed 
by Planck that the energy of a harmonic oscillator takes discrete values. Unlike 
Planck, Bohr quantizes the orbital angular momentum, not the energy. How- 
ever, we will see that by quantizing the angular momentum, the energy will 
also be quantized. The second postulate is needed to prevent the instability 
predicted by electromagnetic theory. The third postulate is basically a reaf- 
firmation of the photon concept, introduced by Einstein, coupled with the 
requirement of energy conservation. If light is made of photons of energy hv, 
then, when a photon is emitted by the atom, the atom must lose an amount 
of energy equal to the energy of the photon, that is, E; — E, = hv. 


18.4b Energy Spectrum of the Bohr Atom 


Using the first postulate, Bohr showed that the energy E of an electron in the 
hydrogen atom can have only certain values; that is, the energy spectrum is 
quantized. 

From the first postulate, Eq. 18.10 
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L = mor= mh 


Neils Bohr (1885-1962). 
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If we substitute this for the velocity v into Eq. 18.2, we have 


1 e om , h 
ATE, r2 ~ Yr e mr? (18.12) 


we can solve for the possible radii r, of the orbits. We obtain 


ty = NP (18.13) 
where 
Amh€, 
fo = Zz (18.14) 
We see that only orbits with radius 19, 479, 979, . . ., are allowed. 


We can use this result to find the predicted size of the atom. Any system, 
if left alone, will tend to go to the lowest energy state available to it. In this 
case, the lowest state corresponds to the orbit of smallest radius, that is, ro 
(see Eq. 18.4). Thus, according to the Bohr model, the normal state of the 
hydrogen atom is a circular orbit of radius ro, or (recall that 1/47€, = 9 x 10° 
N-m?/C?) 

_ 4meh* — (1.11 x 107° C*/N-m’*)(1.05 x 107~* J-sec)? 
em (1.6 x 10-19 C?? (9.1 x 10-3! kg) 


0.53 x 10° m=0.53A 


Yo 


This result is in agreement with the experimentally known size of the atom. 
The restriction on the allowed radius for the orbit leads immediately to 
the quantization of the energy spectrum. From Eq. 18.4 


7 2 ATE r ( : ) 
Substituting Eqs. 18.13 for r into Eq. 18.4 we get 
2 
a 
S7Eon To 
Using the value for ro from Eq. 18.14 we obtain 
4m 1 
a e*m 
8 eth? n 
or 
Eo 
E, = ~ 2 i= 1 2;0,°"* (18.15) 
where 
etm 
= 18.16 
Eo 8ezh2 ( ) 


We may evaluate the energy for the smallest orbit, that is, 1 = 1. Note that 


for this computation, because = 9 x 10° N-m’/C’, we may evaluate €, 


TTEg 


en 


— Bezh? 


as 8.84 x 107 !* C?/N-m?. 


_ (1.6 x 1079 C)4(9.1 x 10-3! kg) 
~ (8)(8.84 X 10712 C2/N-m?)2(6.63 x 1073 J-sec)? 


= 2.17 x 10° J = 13.56 eV 
and from Eq. 18.15 


Eo 


E,=1 = — 13.56 eV 


which is the lowest energy state of the hydrogen atom. 

Unlike the classical planetary model where all orbits and corresponding 
energies are allowed, the Bohr model restricts (quantizes) the spectrum of 
energies that the atom can have. A few of the energy levels of the hydrogen 
spectrum are shown in Fig. 18-4. The lowest state available to the atom 
corresponds to the quantum number n = 1; the energy of this level, called 
the ground state, is —13.56 eV, which is the value we have just calculated 
using Eq. 18.15. The other levels, corresponding to higher values of n, are 
called the excited states of the atom. Normally, the electron is in the ground 
state. This is due to the tendency of all physical systems to seek the lowest 
energy state available to them. 

To ionize the atom (to separate the electron from the nucleus, that is, to 
make r = ~), 13.56 eV of energy must be given to it according to the model. 
This agrees with the experimentally measured ionization energy of hydrogen. 
Once the atom has been ionized, a free electron near the nucleus can be 
captured by it; that is, an electron will fall into one of the allowed orbits. This 
capture can take place in one step—namely, the electron in one jump falls 
down to the ground state—or in a series of steps in which the electron falls 
into one or more excited states before it winds up in the ground state. In each 


0 eV 
—0.54 eV 
—0.85 eV 


a 
Houeue tl 
WHO 8 


22 Se Se 5 eV 


Energy ——_—____—» 


r=] —-———  — 13.56 eV 


-1.51 ev - 
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FIGURE 18-4 

Some of the discrete energy lev- 
els for the electron in the hydro- 
gen atom according to the Bohr 
model. An infinite number of 
closely spaced energy levels ex- 
ists between n = 5andn = %. 
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transition toward a lower energy state the electron loses a precise quantity 
of energy, called a quantum (that is, a photon is emitted). The energy, and 
consequently the frequency of this photon, will depend on the energy dif- 
ference of the levels between which the transition takes place. It should be 
clear at this point that, because the energy difference between levels is not 
continuous, the frequency spectrum of the emitted photons will not be con- 
tinuous either, but rather it will be discrete. In a typical spectroscopic exper- 
iment, such as the one described in Section 18.3, we deal with a very large 
number of atoms. These atoms are ionized by the electronic discharge and 
subsequently decay to the ground state in one or several steps. Because the 
number of atoms involved is large, all possible transitions and corresponding 
frequencies will be observed. We will now show that the wavelengths (and 
frequencies) predicted by the Bohr model for the hydrogen atom are the same 
as those observed experimentally, that is, those given by the Rydberg-Ritz 
formula of Eq. 18.9. 


18.4c Spectral Lines Predicted by the Bohr Model 


We have seen that the energy levels of the Bohr atom are given by 


Eo 
La = 7 (18.15) 
where 
e*m 
Ey = 8e2h2 (18.16) 
i, Bop Eo 
Suppose that an electron is initially in a state of energy E; = — 72 and then 


E 
— . According to the third 


makes a transition to a state of energy E, = — ee 


postulate, Eq. 18.11, a photon of frequency 
_ Fite 
oh 


© Vv 


will be emitted (see Fig. 18-5). Substituting for E; and E;, we get 


Loto 

id mJ _ Eo (1 1 
h hh \n on 

If we use the fact that Av = c (Eq. 17.8), we can obtain an expression for the 


wavelengths of the emitted photons that is quite similar to the empirical 
relation of Eq. 18.9. 


(18.17) 


(18.18) 


FIGURE 18-5 

Anelectron in the atom decaying 
from an energy level E; to a lower 
energy level E, emits a photon of 
energy hv. 


ae R See 18.19 
\ Bohr n? n2 ( . ) 
Eo . 
where Rgonr 1S a constant equal to hc and is numerically equal to 
Eo e*m 
R ohr ~ 7. = 
aa he =8es¢h>c 


(1.6 x 10719 Ct x (9.1 x 10731 kg) 
8 x (8.84 x 107 1? C?/N-m?)? x (6.63 x 107 * J-sec)? x (3 x 108 m/sec) 


1.09740 x 107m“! 


We see that Eq. 18.19 is similar to the Rydberg-Ritz formula (Eq. 18.9) and 
the constant Rgon, is almost identical to the Rydberg constant R. (The difference 
between Rgonr and the Rydberg constant is further reduced if the motion of 
the proton is considered.) All that remains to be shown is that the integers 
in both equations coincide. If we consider transitions where the final state is 
in the ground state, then n,;in Eq. 18.19 is equal to 1. Transitions to the ground 
state can take place when the atom is initially in the first excited state (n; = 
2) or the second excited state (n; = 3) or the third excited state (n; = 4), and 
so on. Thus when n; = 1, n; can be 2, 3, 4, .. . . These are the values taken 
by the integers n, and n; of the Rydberg-Ritz formula for the Lyman series. 
similarly, if we consider transitions to the first excited state (n; = 2), then the 
initial state can be the second excited state (n; = 3), the third excited state (n; 
= 4), and so on. The Balmer series corresponds to transitions to the first 
excited state from higher energy states. The other families of wavelengths 
can be explained with similar arguments. Some of the transitions that account 
for each family of lines are shown in Fig. 18-6. 


ae 
| 
aaa ee 
anes 

“U0 

a 

w 

o) 

> 

) 

3 

N 


THE BOHR ATOM 273 


FIGURE 18-6 

Transitions responsible for some 
of the wavelengths in each of the 
spectral series of the hydrogen 
spectrum. Additional transitions 
have been omitted for the sake 
of clarity in the diagram. 
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Example 18-2 


After being excited, the electron of a hydrogen atom eventually falls back to 
the ground state. This can take place in one jump or in a series of jumps, the 
electron falling into lower excited states before it ends up in the ground state. 
Consider a hydrogen atom that has been raised to the second excited state, 
that is, n = 3. Calculate the different photon energies that may be emitted 
as the atom returns to the ground state. 


Solution The possible transitions are shown in Fig. 18-7. Using the results 
of Eqs. 18.11, 18.15, and 18.16, we can write 


1 1 
Es - E: = Eo (4-5) 


1 
13.56 eV (1 = 3 = 12.05 eV 


1 1 
tes = Es ~ Ea = Eo (5 ) 


hv3, 


22 3 
= 13.56 eV ee = 1.88 eV 
~ "N47 9) > 
1 1 
hms = Ea ~ Ex = Eo (3 - 55) 
1 
= 13.56 eV € _ _ = 10.17 eV 


18.5 THE FRANCK-HERTZ EXPERIMENT 


By postulating the existence of a discrete spectrum of energy levels through 
the angular momentum condition, Eq. 18.10, Bohr was able to predict the 
correct electromagnetic spectrum for hydrogen. The existence of discrete en- 
ergy levels in atoms was demonstrated directly by James Franck and Gustav 
Hertz in 1914. 


FIGURE 18-7 
FIGURE 18-7 
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The experiment that they performed is not very complicated. In fact, 
today it is a standard experiment in an undergraduate modern physics lab- 
oratory. But in spite of its simplicity, the importance can be appreciated by 
the fact that Franck and Hertz were awarded the Nobel Prize in Physics in 
1925. 

A schematic of the experimental set-up is shown in Fig. 18-8. The essential 
part of the apparatus consists of a tube containing vapor of the element under 
study. The tube contains three electrodes: a filament (F) that provides elec- 
trons when heated, a plate (P), and a grid (G). A grid is a charged screen 
that can attract or repel electrons but, because most of it is open space, the 
majority of the electrons pass through it. A variable accelerating voltage Vo 
is applied between the filament and the grid. As a consequence of this po- 
tential difference, the electrons will reach the grid (in the absence of collisions) 
with a kinetic energy E, = eV. After reaching the grid the majority of these 
electrons will go through the holes in the grid, be collected by the plate P, 
and contribute to the plate current i, which can be measured by the ammeter A. 
A small, constant retarding voltage V, (~ 1 V) is applied between the plate 
and the grid. If V, > Vo, the electrons will be turned back before they can 
reach the plate and they will not contribute to the current measured by A. 
But even if V, < Vo, the electrons will not be able to reach the plate if they 
lose enough kinetic energy through collisions with the atoms in the tube as 
they travel between the filament and the grid. 

In the absence of any vapor, that is, a vacuum, the 1-Vo characteristics 
are those of a typical vacuum tube. This dependence is shown by the dashed 
line of Fig. 18-9. We should note that a vacuum tube is an example of a 
nonohmic device, that is, as indicated in Section 15.4, a device in which the 
current does not vary linearly with voltage; therefore, a straight line behavior 
between i and Vg is not to be expected. If vapor of some element is present 
in the tube, one observes a series of fairly sudden dips superimposed on the 
monotonic vacuum curve. The solid curve in Fig. 18-9 shows this effect for 
the case where mercury vapor is present in the tube. 

To interpret the results of Fig. 18-9, let us keep in mind the fact that 
electrons will not contribute to the plate current, even though V, < Vo, if they 
lose enough kinetic energy through collisions with the atoms in the tube. The 


FIGURE 18-8 

Schematic of the apparatus used 
in the Franck-Hertz experiment 
to show the quantization of the 
internal energy of atoms. 


FIGURE 18-9 


Dependence of the plate current 
i (measured by the ammeter A in 
the apparatus of Fig. 18-8) on the 
accelerating voltage V,. 
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fact that there is no drop in the current until Vy = 4.9 V indicates that the 
electrons do not lose energy through collisions until they have 4.9 eV of kinetic 
energy. 

Let us digress for a moment and consider what kind of collisions may 
occur. When the electron collides with a heavy atom such as mercury (Hg), 
there are two possible kinds of collisions: elastic and inelastic. In the case of 
elastic collisions, the total energy of both particles before and after the collision 
is the same. But when one particle (the Hg atom) is much heavier than the 
other (the electron), the requirement that the total energy and momentum be 
conserved leads to the fact that the kinetic energy of the light particle is hardly 
changed, its velocity is simply reversed. By writing down the equations of 
conservation of momentum and energy, it was shown (see Eq. 6.16) that if 
a small particle of mass m and velocity v strikes head-on a large stationary 
particle of mass M, then the velocity v’ of the small particle after the collision 
is 


(=, 6.16 
 mt+M ky) 
Thus if m << M, then v’ ~ — v. The kinetic energy of m does not change 


appreciably. In the case of an elastic collision, the electrons may momentarily 
be deflected, but because they keep their energy, they will eventually be able 
to overcome the small retarding voltage V, and contribute to the plate current 
1. No drop in the current will be caused by this kind of collision. 

In the case of inelastic collisions, the external kinetic energy of the col- 
liding particles becomes internal energy (that is, the particles, in this case the 
Hg atoms, absorb energy and are excited). When this type of collision occurs, 
some of the electrons may lose enough energy to be prevented from reaching 
the plate by the retarding potential V,. If the Hg atom can have a continuous 
distribution of internal energy states, the transfer of kinetic energy from the 
bombarding electrons to the Hg atom could and should occur regardless of 
the energy of the electrons, that is, the drop in the current should occur for 
any value of Vo. The fact that the drop occurs only when Vy = 4.9 V (and 
therefore the E; of the electrons is 4.9 eV) indicates that the first excited state 
of Hg (the smallest amount of energy that the Hg can absorb) is 4.9 eV above 
the ground state. As Vo increases beyond the 4.9 V, the current begins to 
increase again because, although the electrons can and do collide inelastically 
and lose 4.9 eV of energy, they still have enough energy remaining to over- 
come the small retarding voltage V,, When Vo = 2 X 4.9Vor3 x 4.9 V, or 
so on, dips in the current occur again because now the electrons can undergo 
two, three, or more inelastic collisions with the Hg atom; in each collision 
they lose 4.9 eV. 

This interpretation is corroborated by the electromagnetic radiation emit- 
ted by Hg atoms. There should be a spectral line whose frequency is given 
by hv = 4.9 eV or A = 2530 A. Such a wavelength is found in the spectrum 
of Hg. An energy diagram for Hg reconstructed from spectral data is shown 
in Fig. 18-10. The energy difference AE between the first excited state and 


—— — 
FIGURE 18-10 


Some of the atomic energy levels 
of a mercury atom. The energy 
spectrum has been reconstructed 
from the observed wavelengths 
emitted by mercury atoms. Note 
that the energy difference be- 
tween the ground state and the 
first excited state is 10.4 eV — 
5.5 eV = 4.9 eV. Not coinciden- 
tally, the dips in the current i in 
the Franck-Hertz experiment oc- 
cur for values of V, that are in- 
tegral multiples of 4.9 V (see Fig. 
18-9). 
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the ground state is AE = 10.4 eV — 5.5 eV = 4.9 eV. We may ask why do 
we not observe a dip when Vp = 6.7 V? This would correspond to a collision 
where the Hg atom is excited from the ground state to the second excited 
state. The amount of energy needed to produce this transition is AE = 10.4 
eV — 3.7 eV = 6.7 eV. The answer to the question is that such a dip can be 
observed. However, to observe it, the sensitivity of the experiment must be 
increased. The reason is that the electrons can and do lose energy as soon as 
they have 4.9 eV. As a result, the number of electrons that are able to avoid 
the 4.9 eV collision and gather 6.7 eV is a small fraction of the total number 
of incident electrons. This reduces the magnitude of the 6.7 eV dip and makes 


it more difficult to observe. 


PROBLEMS 


18.1 Calculate the shortest and the longest wave- 
length of the Balmer series of hydrogen. 


18.2 What are (a) the energy, (b) the momentum, 
and (c) the wavelength of the photon that is emitted 
when a hydrogen atom undergoes a transition from 
the state n = 3 ton = 1? (The momentum of the 

photon is given by hv/c). 
(Answer: (a) 12.07 eV, (b) 6.44 x 107°” kg-m/sec, 
(c) 1030 A.) 


18.3 The shortest wavelength of the Paschen series 
from hydrogen is 8204 A. From this fact, calculate 
the Rydberg constant. 


18.4 When an atom emits a photon it must recoil 
with a momentum equal but in opposite direction 
to that of the photon. (a) Show that the recoil energy 
of an atom when a transition from one energy state 
to another takes place is given by 


— (h v)? 


E, = 
2 mc? 


where m is the mass of the atom and hv is the energy 
of the photon. (b) How does E, compare with hv in 
the case of a hydrogen atom? 

(Answer: (b) E, ~ 107° hv.) 


18.5 A hydrogen atom is excited froma staten = 1 
to a state n = 4. (a) Calculate and display on an 
energy-level diagram the different photon energies 
that may be emitted as the atom returns to then = 1 


state. (b) Calculate the recoil speed of the hydrogen 
atom, assumed initially at rest, if it makes the tran- 
sition from n = 4ton = linasingle quantum jump. 


18.6 Calculate the ionization energy of the hydro- 
gen atom from the following data. The wavelength 
of the Balmer series limit (n = © ton = 2) is 3645 A 
and the wavelength of the first line of the Lyman 
series (1 = 2ton = 1) is 1215 A. 


18.7. Ahydrogen atom initially at rest in its ground 
state absorbs a 100 eV photon and the ejected pho- 
toelectron moves in the same direction as the initial 
direction of the photon. What is the recoil velocity 
of the proton? 


18.8 Monochromatic light is shone on a hydrogen 
gas with all its atoms in the ground state. As a con- 
sequence, the gas emits radiation containing six dif- 
ferent frequencies. What is the minimum frequency 


of the incident light? 
(Answer: 3.07 X 10'° Hz.) 


18.9 The average lifetime of an electron in an ex- 
cited state of an atom is 107° sec. How many rev- 
olutions does an electron make in the n = 2 state of 
hydrogen before dropping down to the n = 1 state? 

(Answer: 8.3 X 10° rev.) 


18.19 Compare the frequency of revolution in the 
hydrogen atom with the frequency of the emitted 
photon for a transition in which An = 1 when the 
initial state is (a) n = 10, (b) n = 100, (c) n = 1000. 
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18.11 A particle called the mu meson is emitted 
when a nucleus is struck by a high-energy particle 
from an accelerator. The mu meson has the same 
charge as an electron, but its mass is 207 times larger. 
Sometimes this mu meson is captured by a proton 
to form muonic hydrogen. What will be (a) the ra- 
dius of the first Bohr orbit of the mu meson? (b) the 
ground state energy, and (c) the wavelength emitted 
in the transition n = 2 ton = 1? Neglect the motion 
of the proton. 
(Answer: (a) 2.6 x 107° A, (b) —2.81 x 10° eV, 
(c) 5.87 A.) 


18.12 Hydrogen gas is diatomic, that is, Hp. 
But suppose that we have monoatomic hydrogen. 
(a) What energy is needed to raise the electron from 
its ground state n = 1 to the first excited state n = 
2? (b) At what temperature will the average kinetic 
energy of a monoatomic hydrogen gas be equal to 
that energy? 

(Answer: (a) 10.19 eV, (b) 7.87 x 10? K.) 


18.13 Consider transitions in hydrogen atoms in 
which An = 1. (a) Show that for very large n the 
energy change, AE ~ a*(mc*/n°), where m is the 
mass of the electron, c is the velocity of light, 


1 2 
and a = 27 ( =) . (b) a is named the fine 


ATE he 
structure constant; show that it has a numerical value 
of 1/137 and that it is dimensionless. 


18.14 Suppose hydrogen atoms are placed in a me- 
dium in which the dielectric constant k = 6. 
(a) Calculate the radius of the first orbit and the 
ionization energy. (b) Suppose k = 12; calculate the 
radius and ionization energy. (Hint: When two charges 
are placed in a medium of dielectric constant k, the 
constant 1/47ré, in Coulomb’s force law is replaced 
by 1/47ké ). | 

(Answer: (a) 3.18 A, 0.38 eV, (b) 6.36 A, 0.094 eV.) 


18.15 The emission spectrum of a given system 
consists of six different frequencies: 1.5 x 10'° Hz, 


2.7 x 10’ Hz, 3.5 x 10° Hz, 4.2 x 10° Hz, 6.2 x 
10'° Hz, and 7.7 x 10'° Hz. Choose the ground-state 
energy to be E = 0. (a) What are the energies of the 
excited states of the system? (b) Show which tran- 
sitions give rise to each of the six frequencies. (As- 
sume that transitions between all available energy 
levels are possible.) 


18.16 Two hydrogen atoms having the same en- 
ergy undergo an inelastic head-on collision. After 
the collision, the atoms come essentially to rest and 
two photons are emitted. The wavelengths of the 
emitted photons are 3667 A and 1220 A. What was 
the velocity of the hydrogen atoms? Note that mo- 
mentum will be conserved if the hydrogen atoms 
keep a very small fraction of the kinetic energy (see 
problems 18.4 and 18.5). 

(Answer: 3.6 X 10% m/sec.) 


18.17 Assume that the orbit of the moon around 
the earth is a circular orbit of radius 3.8 x 10° m. 
The mass of the moon is 6 X 10” kg, and its orbital 
velocity is 10° m/sec. (a) If the first Bohr postulate 
applies to the motion of the moon around the earth, 
find the quantum number n of the orbit. (b) Use 
Newton’s law of gravitation and Bohr’s postulates 
to show that the allowed orbital radius of the moon 
is r, = nro where fo is a constant. (c) What is a 
fractional difference between the assumed radius of 
3.8 x 10° m and the next allowed orbital radius? 
(Answer: (a) 2.2 x 10%, (c) ~ 10° ®.) 


18.18 Find the smallest electric potential difference 
through which an electron must be accelerated to 
enable it to excite a hydrogen atom out of its ground 
state. 


18.19 The ground-state and the first excited-state 
energies of potassium atoms are —4.3 eV and —2.7 
eV, respectively. If we use potassium vapor in the 
Franck-Hertz experiment, at what voltages would 
we see drops in the plot of current versus voltage? 


Photograph of one of the original tubes used by Davisson and Germer in their electron 
diffraction experiments. 
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19.1 INTRODUCTION 


In this chapter we will present two principles that form the cornerstones of 
quantum mechanics: de Broglie’s hypothesis and the uncertainty principle. 

In Chapter 17 we saw that experiments such as the photoelectric effect 
and the Compton effect could be explained by looking at electromagnetic 
radiation as being made up of particle-like entities called photons. On the 
other hand, the experiments of Young and others, demonstrating the inter- 
ference effects of light, are as valid today as when they were originally per- 
formed; these interference and diffraction experiments require a wave model. 
We must therefore consider electromagnetic radiation as having dual prop- 
erties: It is both a wave and a particle. Although the wave and particle prop- 
erties seem to be irreconcilable, they are not. These two models are, in the 
words of Neils Bohr, complementary. The principle of complementarity is dis- 
cussed at the end of this chapter. 


19.2 DE BROGLIE’S HYPOTHESIS 
AND ITS EXPERIMENTAL VERIFICATION 


In 1925, Louis de Broglie (1892- ) assumed the existence of a natural 
symmetry in nature and proposed that the dual character exhibited by photons 
should equally apply to all material particles. Accordingly, he hypothesized: 
The motion of a particle is governed by the wave propagation properties of a “‘pilot”’ 
wave (also called matter wave). The wavelength r and the frequency v of the pilot 
wave associated with a particle of momentum p and energy E are 


A= : and v= — (19.1) 


To prove de Broglie’s hypothesis we have to show experimentally that 
a beam of particles exhibits wave-like properties, such as interference and 
diffraction. 

Let us look at the feasibility of such experiments. Consider a bullet of 
mass m = 0.1 kg moving with a velocity v = 10° m/sec. According to 
de Broglie, the wavelength of the pilot wave will be, 

~34 
pee ee i ee 
p 0.1 kg x 10° m/sec 
which is 26 orders of magnitude smaller than the diameter of an atom. As 
we saw in Chapter 12, in order to observe interference effects, the size of the 
diffracting slit d must be comparable to the wavelength of the wave. It is 
clear, therefore, that we cannot hope to see a diffraction pattern when a beam 
of macroscopic objects goes through a hole. The smallest slit or diffraction 
grating available in nature is a crystalline solid where d is a few angstroms. 
If we are going to have any success in our attempts to observe the wave 


Louis de Broglie (1892- _). 
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FIGURE 19-1 
Schematic of the apparatus used 
in the Davisson-Germer experi- 
ment. Electrons from a heated 
filament are accelerated through 
a variable potential difference V. 
They strike a nickel crystal, and 
the number of electrons scat- 
tered at a given angle ¢ is mea- 
sured as a function of the accel- 
erating voltage. The experiment 
can be repeated for different val- 
ues of the angle @. 


DE BROGLIE’S HYPOTHESIS AND ITS EXPERIMENTAL VERIFICATION 281 


nature of material particles, we must have a beam with a wavelength not h 
much smaller than a few angstroms. From Eq. 19.1 this means that the mo- 
mentum of the particles has to be relatively small, and the most practical way 
of achieving this condition is to choose a particle of small mass such as the E 
electron. , a> 

In 1927, C. J. Davisson and L. H. Germer performed such an experiment. 
The experimental setup used is shown in Fig. 19-1. Although not shown, the 
electron path must be through a vacuum. Electrons from a heated filament 
are accelerated by a variable voltage V. These electrons emerge through a 
small hole in the anode with a kinetic energy E, = eV. The beam then strikes 
a single crystal of nickel, and the intensity of the scattered beam can be 
measured for different angles @ and various voltages V. Although these in- 
vestigators used a nickel crystal, almost any crystalline solid will produce the 
same results. 

If the propagation of the beam is particle-like, we may expect that the 
intensity I of the scattered beam will have a smooth monotonic dependence 
of both ¢ and V because only elastic collisions with the atoms of the crystal 
are involved. Thus, if we consider the plane of atoms in the crystal to act like 
a wall, the incident particles will carom off so that the incoming angle (angle 
of incidence) is equal to the outgoing angle (angle of reflection) (see Fig. 
19-2a). Then for any incoming angle we will observe the same number of 
particles caroming off at a similar angle regardless of their velocity and thus 
of the potential V through which they have been accelerated. On the other 
hand, if the incoming electrons are not particles but are actually waves, we 


would expect a diffraction effect like the one observed with X rays when the Sos : = | 
Bragg scattering condition discussed in Chapter 12 is satisfied, that is, when ae 
nd = 2d sin 0 (12.11) | ~ t 


From X-ray studies the physicists knew that the spacing between one set of @) 
atomic planes of nickel is 0.91 A. For X rays of wavelength 1.65 A the angle 
of incidence to the plane of atoms that satisfies Eq. 12.11 for first-order dif- 
fraction (n = 1) is 


1.65 A 
= a ge = U7 
sin@=7xX091A 
6 = 65° —_ 
It is seen in Fig. 19-2a that the angle ¢ between the two beams is defined) _ ~— 


as 20 + @ = 180°. The Bragg angle @ is 65° and therefore ¢ = 50°. This was FIGURE 1920 
| , :; (a) Electrons being scattered by 
the angle selected for the experimental arrangement in Fig. 19-1. Now h < vl Phe chek 
ie” 9.1, states that the wavelength of a particle is mubdaceaa ae puame tues aban 
de Broglie hypothesis, Eq. 19.1, 8 P crystal. (b) Intensity of the scat- 
a function of its momentum p. The momentum is related to the kinetic energy  te:eq beam, that is, number of 
of the particle and the kinetic energy, as shown in section 14.4, depends on _ gcattered electrons, as a function 
its accelerating voltage through the relation 1/2 mv? = eV. The experimental of the accelerating voltage V for 
results of Davisson and Germer are shown in Fig. 19-2b. The angle @ between a fixed value of the angle ¢ = 


the incoming and the scattered beams of electrons was set at 50°. The accel- 50°. 
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erating voltage was increased, and the intensity I of the reflected beam was 
measured for different values of this voltage. A maximum was observed when 
the voltage was V = 54 V. Davisson and Germer then calculated the wave- 
length that the electrons would have from the de Broglie hypothesis, Eq. 19.1. 
The momentum p can be obtained from the kinetic energy E,. By definition 


1 
Ex = 5 nv" 


Therefore 


2 es V2m E; 


but E, = eV, and it follows that 


p = V2 meV 


Substituting this relation for the momentum into the de Broglie equation, Eq. 
19.1, 


h h 
7 p — V2 meV 


they obtained a wavelength for the electrons of 


he 6.63 X 107% J-sec 
(2x 9.1 xX 10-2 kg x 1.6 x 107% C x 54 V)!? 


= 167 x 10°°m=1.67A 


The agreement between the two wavelengths—that is, the predicted wave- 
length for electrons of this energy and the wavelength of 1.65 A for X rays 
to be diffracted from nickel—is excellent. Thus the experiment confirms 
de Broglie’s hypothesis that particles have wave properties and that the wave- 
length of the wave is given by Eq. 19.1. 

Since the original experiment by Davisson and Germer, other people 
have found convincing evidence for the wave properties of material particles. 
Estermann, Stern, and Frisch have shown inteference effects with beams of 
hydrogen and helium atoms. Fermi, Marshall, and Zinn have done similar 
experiments with neutrons. Today, the existence of matter waves rests on 
solid experimental foundations, and electron and neutron diffraction mea- 
surements are standard techniques for the study of crystalline materials. The 
design and construction of the electron microscope are based on the principle 
that electrons propagate as waves. 


Example 19-1 


A beam of monochromatic neutrons is incident on a KCl crystal with lattice 
spacing of 3.14 A. The first-order diffraction maximum is observed when the 
angle 6 between the incident beam and the atomic planes is 37°. What is the 
kinetic energy of the neutrons? 


Solution Using the Bragg condition (Eq. 12.11), with n = 1, we can find 
the wavelength of the neutron beam. 
A = 2d sin 0 
2 x 3.14 A sin 37° = 3.78.A 


From de Broglie’s hypothesis (Eq. 19.1), the momentum of the neutrons is 


h 6.63 x 10~** J-sec 7 
p= n = 3.78 x 10-9 m_ = 1.75 x 10 en kg-m/sec 


The kinetic energy of the neutrons will be 


1 p? 
E — = 2—_ 
RO om 
1.75 x 10724 ke-m/sec)” 
g 


= = 9.21 X 10° 
2 x 1.67 x 10°?’ kg J 


5.75 x 10-° eV 


19.3 NATURE OF THE WAVE 


If we accept the hypothesis that a particle is guided by a wave as it moves 
through space, what is the nature of this wave? What is waving? To under- 
stand the accepted interpretation of the nature of the wave, let us first consider 
the case of electromagnetic radiation. 

Classically, electromagnetic (EM) radiation is an electric and a magnetic 
field whose behavior in time and space is comparable to that of a wave in a 
string (see Section 16.8). We know, however, that this model of EM radiation 
cannot explain the photoelectric effect or the Compton effect. To explain these 
effects, we need a particle model. The fact remains that when a beam of 
photons passes through a double slit, an interference pattern is observed; this 
is a unique wave property. The question is therefore: Within the photon picture, 
what is the nature of the wave that governs the propagation of photons? 

The link between the classical (wave) picture and the photon (particle) 
picture is the intensity I. The intensity has meaning in both models; it is 
simply the energy per unit time, per unit area of the beam (see Section 11.4). 
The difference is the way the energy is carried by the beam: localized (in the 
particle model) versus continuous (in the wave model). We saw in Chapter 
11, Eq. 11.20, that the intensity of a wave is proportional to the square of the 
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Electron diffraction pattern of a 
TizNdjoO.9 crystal. As in the 
X-ray diffraction on page 172, 
each bright dot is produced by a 
set of atomic planes that satisfies 
the Bragg condition for the de 
Broglie wavelength of the elec- 
trons. 
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amplitude of the wave, that is, 
I « (amplitude of the wave) (Wave model) 


On the other hand, it is relatively easy to see that, in the photon model, the 
intensity is given by 
I = (c)(N)(hv) (Particle model) 


where N is the number of photons per unit volume of the beam, hv the energy 
of each photon, and c is the velocity of the beam, which is the velocity of 
light. 

Classically, one can increase the intensity of the EM radiation by increas- 
ing the amplitude of the wave. In the photon scheme, the intensity of a 
monochromatic beam is increased by increasing the number of photons per 
unit volume, N. This comparison tells us that the number of photons in the 
beam must be proportional to the square of the amplitude of the wave. 


N « (amplitude of the wave)’ (19.2) 


When we talk about the density of photons, we should not think of N as an 
exact number in the sense that one finds exactly the same number of photons 
in a given volume of the beam. There are fluctuations. We do not notice these 
fluctuations with ordinary intensities, such as the intensity from a nearby 
lamp, because the number of photons is very large. But if one were to shine 
light on a screen with an intensity corresponding to 100 photons per second 
per meter”, one would make the following observations: 


1. The 100 photons per second is an average value: one second, we may have 
105 photons landing on the screen, another 92, yet another 97, and so 
forth. 


2. One cannot predict where the next photon striking the screen will land. 
We will find that if we wait long enough, the same number of photons 
will land on one particular section of the screen as on any other. 


The reason for this randomness in the behavior of photons is the process 
of photon emission itself. It is a statistical process in which a large number 
of atoms, after being excited or ionized, eventually decay (with a half-life that 
is typically 10~® sec) to the ground state while emitting photons. When we 
say that the half-life of the excited state is 10~° sec, we mean that half of the 
excited atoms will decay in 10~® sec. There is no way of predicting whether 
a particular atom will decay before or after 10~° sec. 

As a result of this randomness we cannot state with certainty whether 
or not we will find a photon within a small volume of the beam: We can 
nevertheless talk about the probability of finding a photon within such a 
volume. Suppose that N = 1 photon/m?. Because this photon can be anywhere 
in the m°, the probability of finding the photon within a AV = 1 cm? will be 
1 in 10° (1 m? = 10° cm?). If N is 10 photons/m”, the probability of finding a 
photon within the same AV will be 10 times greater. We see that the probability 
of finding a photon, the probability density, within a given volume of the 


Max Born (1882-1970). 


beam is proportional to the density of the photons N. But from Eq. 19.2, 
N « (amplitude of the wave)’; we therefore conclude, as did Einstein, that 
the probability of finding a photon within a given volume of the beam is 
proportional to the square of the amplitude of the wave associated with the 
beam. Within the photon model the wave is not an electromagnetic field; it is rep- 
resented by a mathematical function that measures the photon probability density. 

In 1926, Max Born (1882-1970) extended this probabilistic interpretation 
to the matter waves proposed by de Broglie for material particles. According 
to this interpretation, accepted by most scientists today, ‘‘what is waving” is 
nothing physical like the particles in a string or the water molecules in a pond 
when the ripples go by: The guiding wave is represented by a mathematical 
function, ¢ (r,t), called a wavefunction. The physical significance of the wave- 
function is the following: At some instant t, a measurement is made to locate the 
particle associated with the wavefunction ws. The probability P(r,t) dV that the particle 
will be found within a small volume dV centered around a point with position vector 
r (with respect to a prechosen set of coordinates) is equal to |p|? dV, that is, 


P(r,t) dV = |W? dV (19.3) 


We should note that the probability of finding the particle somewhere in 
space must be unity. Therefore, the wavefunction is must be normalized; that 
is, it must satisfy the condition 


I. W2 dV =1 (19.4) 


where the integration is carried over all space. 

The methods of quantum mechanics consist in first finding the wave- 
function associated with a particle or a system of particles. As we will see in 
Chapter 20, all the information about the physical properties of the system 
can be obtained from it. 

It is interesting to note that among the critics of this probabilistic inter- 
pretation of matter waves we find Einstein (‘“God does not play dice with the 
Universe’) and de Broglie. The reason for the opposition is that the inter- 
pretation introduced by Born goes against one of the fundamental philo- 
sophical tenets of classical physics: the concept of ‘determinism.’ This brings 
us to one of the fundamental principles of quantum mechanics: the uncertainty 
principle, first enunciated in 1927 by Werner Heisenberg (1901-1976). 


19.4 THE UNCERTAINTY PRINCIPLE 


The use of probability functions to describe the behavior of a system is not 
foreign to classical physics. In statistical mechanics, where one often deals 
with a very large number of particles (for example, the study of a gas), we 
do not deal with the individual velocities, energies, or such, of the particles. 
We talk about the average value of these quantities and we deal with velocity 
and energy distribution functions, that is, the number or fraction of a group 
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P(r,t) dV = |\y* dV 


[ Wir dV = 1 
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at each velocity or energy (Chapter 9). In classical physics this recourse to 
probability functions is the result of practical necessity. If we want to study 
the behavior of 10”? molecules in a gas we could, in principle, write down and 
solve 10*° coupled differential equations for the coordinates of each molecule. 
But this would be beyond the most advanced computer and would not yield 
any useful information. In quantum mechanics, the use of probability func- 
tions is one of fundamental necessity: We have no choice. Even for the sim- 
plest of systems, the one particle system, we cannot determine its position 
as a function of time, x(t). 

What is the root of this basic disagreement? According to classical physics, 
if you know the forces acting on a particle and you know the initial conditions 
(that is, the position x9 and the velocity vo at some initial time t = 0), then 
by solving Newton’s laws of motion, the position x(t) of the particle at any 
other time can be predicted or determined (hence “‘determinism’’). The as- 
sumption here is that the initial position and velocity can be determined 
exactly. Classical physics assumes that given the proper measuring instru- 
ment, this can be done. The answer of quantum mechanics is different and 
was presented in the famous uncertainty principle by Werner Heisenberg: 
An experiment cannot simultaneously determine a component of the momentum of a 
particle, for example p, and the exact value of the corresponding coordinate x. The 
best one can do is 


Ap, Ax =h (19.5) 


where A represents the uncertainty of measurement. 
There are three points that we should note about the uncertainty prin- 
ciple. 


1. The limitations imposed by the uncertainty principle have nothing to do 
with the quality of the experimental instrument. With the ideal instrument, 
the best one can do is given by Eq. 19.5. 


2. The uncertainty principle does not say that one cannot determine the 
position or the momentum exactly. However, if Ax = 0, then the uncer- 
tainty in the momentum will be infinite, and vice versa. 


3. Later in this chapter we will show that the uncertainty principle is a direct 
consequence of de Broglie’s hypothesis, which, as we have seen, is con- 
firmed by experiment. Thus the uncertainty principle is based on experi- 
ment. 


In classical physics we assume that the position and the velocity of a 
particle can be determined with any desired degree of accuracy. Because of 
the smallness of Planck’s constant, this assumption is a very good approxi- 
mation when dealing with macroscopic objects; the approximation breaks 
down in what Born called the ‘restless universe” of the atom. Let us see what 
the implications of the uncertainty principle are when we try to predict the 
trajectory (the future) of the moon around the earth. The mass of the moon 
is 6 x 10” kg, and its average orbital velocity is 10° m/sec. Suppose that we 
are able to determine the position of the moon with an uncertainty Ax = 


Werner Heisenberg (1901-1976). 


Ap, Ax = h 
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10~° m, a very small uncertainty in this case. What is the limit of our ability 
to determine its velocity simultaneously? From Eq. 19.5 


h — 10~** J-sec 


ae 10~ 7° kg m/sec 


Ap, = 


Because by definition p, = mv,, it follows that 


Ap, _ 107** kg m/sec 


> =~ 19~50 
- 6 x 102 kg QO~° m/sec 


KO. 


This is an insignificant error when we compare it with the measured value of 
v = 10° msec. 

Now, let us consider an electron in the hydrogen atom. We saw in Chap- 
ter 18 that the smallest radius of a Bohr orbit is approximately 0.5 x 
10~1° m. Suppose that, in the hope of being able to determine p, as accurately 
as possible, we decide to give up all knowledge about the coordinates of the 
electron, that is, let Ax ~ 107 1° m. (The electron can be anywhere in the orbit, 
and therefore x can take any value between 0.5 x 107'° m and —0.5 x 
10~!° m.) With what accuracy can the x component of p be determined? From 
Eg. 19.5 


h 107 34 J-sec 


ia 4910 = 10°** kg m/sec 


Ap, = 
To understand the significance of this uncertainty, let us compare it with the 
magnitude of the momentum of the electron. In Chapter 18 we showed that 
the kinetic energy of the electron was equal to the magnitude of the total 
energy (see Eqs. 18.3 and 18.4). The latter is of course negative and equal to 
—13.6 eV; hence FE, = 13.6 eV = 13.6eV Xx 1.6 x 10” J/eV = 2.18 x 107 aa 
The corresponding momentum is 


(2m cE) = (2 x 9.1 x 107°! ke x 218 x 10718 J)! 


P 


2 x 10°*4 kg m/sec 
Then the ratio of the uncertainty of the momentum Ap, to the momentum 
itself is 


Ap, __—«10 -** kg m/sec 
p 2x 10~-%4kg m/sec 


= 0.5 or 50% uncertainty 


This simple example shows that the classical concept of determinism cannot 
be carried over to atomic phenomena. Note that if we try to specify the location 
of the electron more precisely than the total orbit, that is, make Ax smaller, 
the uncertainty in the momentum becomes even greater than 50%. With such 
a limitation on simultaneous knowledge of position and momentum, we must 
develop and use the concept of the probability of location and momentum. 
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19.5 PHYSICAL ORIGIN OF 
THE UNCERTAINTY PRINCIPLE 


In the next section we will show that the uncertainty principle is a direct 
consequence of de Broglie’s hypothesis. Here we present a ““gedanken”’ (thought) 
experiment proposed by Bohr, which illustrates the physical origin of the 
principle, namely, the measuring process itself introduces the uncertainty. 
We can determine very accurately the position of a planet without significantly 
altering its momentum. The same is not true in the submicroscopic world of 
atoms and molecules. 

Suppose that we want to determine the position of an electron and that 
we can look at the electron with a hypothetical microscope. To see it, we must 
shine light on the electron; it is the scattered photons that the eye really sees. 
Any photon scattered by the electron within an angle equal to 2¢ (see Fig. 
19-3) will be focused by the microscope lenses and will be detected by the 
eye. It is clear that the collision of the photon with the electron will change 
the momentum of the electron. We can minimize this disturbance and the 
resulting uncertainty in the momentum of the electron by reducing the in- 
tensity of the light. The smallest intensity we can use, according to the quan- 
tum mechanical model of light, is one photon. This photon may enter the 
objective lens anywhere within the angular range +¢ to —@. This implies 
that the x-component of the momentum of this photon could have any value 
between —p sin ¢ and p sin ¢ (where p is the magnitude of the momentum 
of the photon). We conclude, therefore, that the uncertainty of the x-com- 
ponent of the momentum of the photon is 


Ap, (photon) = 2ppnoton sin 


Conservation of linear momentum necessitates that if the photon acquires a 
certain momentum in the x direction, the electron must acquire the same 
amount in the opposite direction. This means that the uncertainty in the x- 
momentum of the electron also has the same magnitude, that is, 


Ap, (electron) = 2ppnoton Sin h 


Because the momentum of the photon is given by p = h/A (see Eq. 17.18) we 
get 


h 
Ap, (electron) = 2 : sin d (19.6) 


Thus, we see that in the process of locating the electron, we have introduced 
an uncertainty in its momentum. 

We could reduce the uncertainty in p, of the electron in two ways. We 
can use photons of longer wavelength: Instead of using visible light, we could 
use microwaves or radio waves and a detector that is sensitive to them. 
Simultaneously, we could reduce @¢, the angle subtended by the lens: We can 
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FIGURE 19-3 


“Looking” at an electron with a 
hypothetical microscope in Bohr’s 
gedanken experiment. The elec- 
tron is illuminated with light 
(photons) The photons scattered 
by the electron that enter the ob- 
jective lens of the microscope are 
detected by the eye of the ob- 
server. 
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make the aperture of the lens small. Unfortunately, these two factors that 
would reduce the uncertainty in p, lead to an increase in the uncertainty of 
the position of the electron that we are trying to locate. In Chapter 12, Section 
12.6, we showed that owing to diffraction effects, our ability to resolve two 
separate sources of light decreases as the wavelength of the light emitted by 
the sources increases and as the size of the opening used to “look” at the 
sources decreases, Eq. 12.10. Thus, in the experiment under consideration, 
longer wavelengths and smaller lens aperture will increase the uncertainty 
about the origin of the photon entering the microscope, that is, about the 
position of the scattering electron. 


19.6 MATTER WAVES AND THE UNCERTAINTY PRINCIPLE 


We have seen that the spacial propagation of a particle is governed by a wave 
u(x,t). Let us try to find the specific mathematical form of the wavefunction 
u(x,t). We can start by looking at a very common type of wave: the sinusoidal 
traveling wave that we discussed in Chapter 11, Eq. 11.5’ 


u(x,t) = A sin (kx — at) (11.5’) 


We recall that this particular wave has the following properties: 


1. The amplitude A is the same at all points in space. 

2. It has a well-defined wavelength, A = 2a/k (Eq. 11.12). 
3. It has a well-defined frequency, v = w/27 (Eq. 11.14). 
4. It travels toward increasing values of x with a velocity 


=Av=5 11.2 
v= Aves (11.2) 
Can we associate this wave with a free particle? If the particle has a well- 
defined momentum and energy, the wave of Eq. 11.5’ satisfies the require- 
ments set forth by de Broglie, namely, a well-defined A and v. We should 
note, however, that because the amplitude of the wave is the same for all 
values of x, the particle can be found with equal probability at any point in 
space, that is, the particle is completely unlocalized, Ax = ~. Actually, we 
should not be surprised by this result. A well-defined A implies a well-defined 
momentum for the particle, that is, Ap, = 0. This in turn, according to the 
uncertainty principle, should lead to an infinite uncertainty in the coordinate 
of the particle. Thus, although the wavefunction of Eq. 11.5’ can be used to 
describe a particle of well-defined momentum and energy, the price paid is 
a complete lack of localization. 

Before we consider the type of wave to be used to describe a partially 
localized particle, we will show that the velocity of the wave given by Eq. 


28° 
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11.5’ is not the same as the velocity of the particle that it guides. The velocity 
of this wave is given by Eq. 11.2 


v= Av 


De Broglie’s relations are A = h/p and v = E/h, where p is the momentum of 
the particle and is equal to MUpartice, and E is the energy of the particle 
3 MVparticle. Substituting for v and A into the expression for v, we obtain 


1 2 
= E = E _ 2 MUparticle | 
CO a i ee = 2 Uparticle 
P Pp MU particle 


The velocity of the wave is half the velocity of the particle. In this case, because 
the amplitude of the wave is the same everywhere in space, it is not a serious 
problem. However, this condition would not be acceptable in the case of a 
partially localized particle, which we discuss next. 

To describe a particle that is partially localized, we need a wave with an 
amplitude that is different from zero only over a small region of space where 
there is a chance of finding the particle. We need a wave that looks like the 
wave packet shown in Fig. 19-4. The more localized the particle is, the narrower 
the wave packet must be. Mathematically, such a wave packet is obtained by 
mixing (adding) together an infinitely large number of sinusoidal traveling 
waves of the type described by Eq. 11.5’. Each of these waves differs in 
wavelength and frequency by an infinitesimally small amount from the pre- 
vious one. The amount of a given wave included in the mixing is determined 
by the amplitude A of that particular wave, that is, the amplitude becomes 
a function of the wavelength and of the frequency, A(A,v). Because the sum 
is carried over waves of infinitesimally different A’s and v’s, it can be written 
as an integral with respect to A and v. Alternatively, we can express the wave 
packet as an integral with respect to k and w because these are related to A 
and v, respectively, by Eqs. 11.12 and 11.14. The wave packet can be written 
as 


wix,t) = i “ I ” A(k,c) sin (kx — wt) dk do (19.7) 


For a given k(A) and w(v), the amplitude A has a definite value. However, A 
can be different for different k’s and w’s. The coefficient A basically determines 
how much of a particular wavelength and frequency we mix with the others. 
These coefficients determine the shape of the wave packet. Because of math- 
ematical complexity, we will not go over the details involved in the calculation 
of the integral, Eq. 19.7. However, as an illustration, Fig. 19-5 shows the 
spacial variation of w at t = 0 when we mix various amounts of sinusoidal 
waves of different wavelength. The graphs to the left show the form of A 
(that is, the amount of mixing), and the graphs to the right, the resulting 
wave packet w. 

The salient feature of these graphs is fairly obvious: The greater the range 
of k’s (and therefore A’s) that we mix, the narrower is the width of the resulting 


| 


FIGURE 19-4 

A wave packet may be used to 
describe a partially localized 
particle. The particle is located in 
the region of space where the 
amplitude of the wave packet is 
not zero. 
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FIGURE 19-5 
Wave packets obtained by mix- 
ing (adding) sinusoidal traveling 
waves of different wave vector k 
(and therefore, different wave- 
length). The graphs to the left 
show the amplitude (the amount 
of mixing) of the sinusoidal waves 
being mixed as a function of k, 
A(k) uh the graphs to the right show the 
, resulting wave packet. The sali- 
| ent feature of these graphs is ob- 
vious: The greater the range of 
k’s being mixed, the narrower the 
———— — —_—————»x resulting wave packet. 


wave packet. The physical significance of this mathematical result is clear, 
and we can see the uncertainty principle coming through. When w contains 
only one k, the wave has a well-defined wavelength and consequently the 
particle with which this wave is associated has a well-defined momentum. 
The amplitude of the wave is the same for all points in space: The particle is 
completely unlocalized. To localize the particle, we must mix waves with a 
range of k’s (and correspondingly a range of wavelengths). Because the wave 
in this case does not have a well-defined wavelength, the particle with which 
this wave is associated will not have a well-defined momentum. The wider 
the range of wavelengths that we mix, the wider the range of values that the 
momentum of the particle can take. But, at the same time, the narrower the 
wave packet will be and, consequently, the better localized the particle 
will be. 
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19.7 VELOCITY OF THE WAVE PACKET: 
GROUP VELOCITY 


We have seen that to describe a localized particle we must use a wave packet. 
Because the wave packet accompanies the particle and tells us approximately 
where the particle may be found, it must travel with the same velocity as the 
particle. 

Rather than looking at the wave packet of Eq. 19.7, let us consider a 
simpler case that is mathematically easier to handle. Let us mix two traveling 
waves that differ slightly in wavelength and frequency, 


Ww = Asin [kx — at] 
W = Asin [(k + Ak)x — (mw + Ao)t] 
where 
Ak<<k and Aw<<w 
The resulting y(x,t) will be 
W(x,t) = Wh + tp 
Wx,t) = Asin [kx — wt] + Asin [(k + Ak)x — (w + Ao)t] 


If we use the trigonometric relation 


sin A + sin B = 2 cos 
we get, 


u(x,t) = 2A cos =) sin (kx — ot) (19.8) 


where we have used the approximations 2k + Ak ~ 2k, 20 + Aw ~ 2a. The 
resulting wave is thus the product of two traveling waves. The second term 
of Eq. 19.8 represents a wave having roughly the same frequency and wave- 
length as the original waves. The first term of Eq. 19.8 represents a wave 
having a much larger wavelength and much smaller frequency. (Because 
A = 2n/k, if k is very small the corresponding A will be very large. Thus, in 
Eq. 19.8 the A represented by Ak must be much larger than the A for k. The 
opposite is true for the frequency. Because w = 27v, a small Aw has a smaller 
frequency v associated with it than does a large w). We can consider w as a 
wave similar to the original ones except that its amplitude is modulated by 
the first term, giving rise to a periodically varying amplitude. A ‘snapshot’ 
of # is shown in Fig. 19-6. Here we do not have a single wave packet as in 
the cases shown in Fig. 19-5, but rather a series of packets. Such a w could 
be used to describe a beam of particles, with one particle in each wave packet. 
The velocity of the wave inside the envelopes is the same as the velocity of 
the individual waves, v = w/k, Eq. 11.2. By analogy to this, and on inspection 
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of Eq. 19.8, the envelopes (that is, the wave packets) travel with a velocity, 
called the group velocity, Vgroup, given by 


oa 
2 Aw dow 
Ugroup Ak Ak dk ( 9 9) 
2 


We can show that v,,oup is the same as the velocity of the particle, by 
using de Broglie’s relations, 


_h _ 20 _p _ dp 
A = or because A = L! k = Fs hence dk = Fs 
and 
E dE 
y= or because v = =~, o=s% hence do = —— 
Substituting for dk and dw into the expression for Vgroup, Eq. 19.9, we get 
dE 
Ugroup = ff (19.10) 


But we know that E = } mUeartiae = p?/2m; it follows that 


= dE _ Te _ MUparticle = y 

dp _ m m particle 
The wave packet moves with the particle. Although we have derived this 
result for a particularly simple case, it holds for the more general case de- 


scribed by Eq. 19.7. 


19.8 THE PRINCIPLE OF COMPLEMENTARITY 


When we summarize our knowledge of nature, we see that energy can be 
carried two ways: wave-like, such as in the case of the ripples in a pond, or 
particle-like, such as in the transfer of energy from a gun to the target by the 
bullets. To explain these diverse phenomena, we construct two distinct math- 


FIGURE 19-6 

A “snapshot” of a wave with a 
periodically varying amplitude. 
Such a wave is obtained by add- 
ing two sinusoidal traveling 
waves of slightly different fre- 
quency and wavelength. 
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ematical models: a wave model and a particle model, each satisfying certain 
laws—Newton’s laws of motion for particles and the superposition principle 
for waves. 

As scientists, we try to extend the models into regions that are visually 
less accessible. We explain the behavior of sound using the wave model, and 
we do so successfully. Similarly, we use the particle model to explain the 
interaction of the molecules of a gas with the walls of the container, and again, 
we are successful. 

These successes condition us to expect all entities to behave either as the 
particle model predicts or as the wave model does. Unfortunately, when we 
deal with electromagnetic radiation or with elementary particles such as elec- 
trons, protons, or neutrons, neither model is able to account for their complete 
behavior. We need a more complex model, one that encompasses aspects of 
both the particle and the wave models. This dual character of matter is sum- 
marized in Bohr’s principle of complementarity: The particle and the wave models 
are complementary. Certain measurements reveal the wave aspects of electro- 
magnetic radiation and of material particles; measurements that involve their 
spacial distribution. Other measurements reveal their particle aspects; mea- 
surements dealing with the interaction with one another or with other entities. 

Because the predictions of the wave model concerning either the prop- 
agation through space or interactions are incompatible with those of the par- 
ticle model, no measurement can simultaneously reveal the particle and the 
wave properties of matter. In a given situation, electrons are either particles 
or waves, not both. However, to understand their overall behavior, both 
models are needed and, as we have shown, both models can be made math- 
ematically equivalent through the connecting link of de Broglie’s hypothesis. 


The allowed orbits for the elec- 


tron in the hydrogen atom (as 
proposed by Bohr) are those in 
which the de Broglie wavelength 
of the electron for that orbit fits 
an integral number of times (see 
problem 19.15). 


PROBLEMS 


19.1 The wavelength of one of the yellow emission 
lines of sodium is 5890 A. What is the kinetic energy 
of an electron having the same de Broglie wave- 
length? 


19.2 Thede Broglie wavelength ofa protonis 107m. 


(a) What is the speed of the proton? (b) Through 
what potential difference must the proton be accel- 
erated to acquire such a speed? 


19.3. Through what potential difference must an 
electron be accelerated for its de Broglie wavelength 
to be equal to the smallest wavelength of the X rays 
produced by an X-ray tube operating at 50,000 V? 
(Answer: 2440 V.) 


19.4 For quick calculations of the wavelength of 
electrons accelerated through a potential difference 
V, physicists use the formula A(A) = V150/V. 
Derive this formula, and find the percentage inac- 
curacy of the equation. 

(Answer: 0.32%.) 


19.5. The limit of resolution of an object is of the 
same order of magnitude as the wavelength used to 
“see” the object. An electron microscope operates 
at 60,000 V. What is the approximate size of the 
smallest object that can be seen with such a micro- 
scope? 


19.6 An electron, a neutron, and a photon have 


the same wavelength, A = 1 A. (a) Calculate and 

compare the frequency of each. (b) Calculate the 
energy of each. 

(Answer: (a) 3.64 x 10° Hz (electron), 

1.98 x 10'S Hz (neutron), 3.00 x 10’°Hz (photon), 

(b) 151 eV (electron), 8.2 x 1077 eV (neutron), 

1.24 x 10* eV (photon).) 


19.7 An a particle is emitted from a nucleus with 
an energy of 5 MeV (5 x 10° eV). Calculate the 
wavelength of an a particle with such energy and 
compare it with the size of the emitting nucleus that 
has a radius of 8 x 107 m. 

(Answer: 6.4 x 107° m.) 


19.8 What is the energy of an electron with a wave- 
length equal to the nuclear diameter, about 10°“ 
m? 


19.9 A beam of electrons having a range of ve- 
locities enters a region of space with an electric field 
perpendicular to a magnetic field and both fields 
perpendicular to the direction of the beam (see Fig. 
19-7). What is the de Broglie wavelength of the 
emerging electrons? The symbol : next to B in Fig. 
19-7 indicates that the direction of the magnetic field 
is out of the page. 


hB 
A i . 
(Answer ne ) 


oe 
FIGURE 19-7 
Problem 19.9. 


19.10 A neutron of wavelength 3 A undergoes a 
first-order Bragg reflection from a calcite crystal with 
lattice spacing 3.036 A. Calculate the reflection an- 
gle. 


19.11 In neutron spectroscopy a beam of mono- 
energetic neutrons is obtained by reflecting reactor 
neutrons from a beryllium crystal. If the separation 
between the atomic planes of the beryllium crystal 
is 0.732 A, what is the angle between the incident 
neutron beam and the atomic planes that will yield 
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a monochromatic beam of neutrons of wavelength 
0.1 A? 
(Answer: 3.9°.) 


19.12 In the Davisson and Germer experiment de- 

scribed in Section 19.2, at what angles would the 

second- and third-order diffraction maxima occur? 
(Answer: They cannot be observed.) 


19.13 A beam of a particles (g = + 2 e) is accel- 
erated through a potential difference of 10 V. It then 
strikes a NaCl crystal (d = 2.82 A). What is the 
highest order Bragg reflection that can be observed? 


19.14 Show that radius of the nth electron orbit in 
the Bohr atom is equal to n/27 times the de Broglie 
wavelength of the electron in that orbit. 


19.15 Suppose we wish to develop the Bohr model 
from the de Broglie hypothesis by assuming that the 
allowed orbits are those that contain an integral 
number of de Broglie wavelengths. Show that this 
condition leads to Bohr’s first postulate, Eq. 18.10, 
mor = nh. 


19.16 A small particle of mass 10~° g moves along 
the x axis; its speed is uncertain by 10~° m/sec. (a) 
What is the uncertainty in the x coordinate of the 
particle? (b) Repeat the calculation for an electron 
assuming that the uncertainty in its velocity is also 
10~° m/sec. 

(Answer: (a) 1.05 x 10~!? m, (b) 115 m.) 


19.17 What is the uncertainty in the position of an 
electron that has been accelerated through a poten- 
tial difference V = 1000 + 0.1 V. (Hint: Express the 
momentum p of the electron in terms of V; differ- 
entiate p (V) with respect to V to find Ap in terms 
of AV). 


19.18 In an experiment the position of an electron 
is determined with an accuracy of + 107’ m. What 
is the uncertainty in the position of the electron 107° 
sec later? 

(Answer: 0.6 < 1073 m.) 


19.19 The uncertainty in the position of a particle 
is equal to the de Broglie wavelength of the particle. 
Calculate the uncertainty in the velocity of the par- 
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ticle in terms of the velocity of the de Broglie wave 
associated with the particle. 
(Answer: Uwavel 7.) 


19.20 A beam of monochromatic electrons is sent 
through a double slit. The width of the slits is smaller 
than the de Broglie wavelength of the electrons. The 
separation between the slits d is greater than but of 
the same order of magnitude as A. A detector is 
placed to the right of the slits at an angle 6 with 
respect to the incident direction of the electrons (see 
Fig. 19-8). When slit 1 is closed, N2 electrons reach 
the detector every second. When slit 2 is closed, N, 
electrons reach the detector every second. Find the 
number of electrons reaching the detector per sec- 
ond when both slits are open if @ is such that 
d sin 0 = X, where J is the de Broglie wavelength 
of the electrons. 

(Answer: N; + Nz + 2 (N,N>)?”) 
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FIGURE 19-8 Detec 
Problem 19.20. 


19.21 In elementary statistics it is shown that the 
standard deviation o, associated with a quantity x 
is equal to 


where x2 is the average of the squares of x, that is, 


n 
Dome 
i=1 

nN 


_and X* is the square of the average of x, that is, 


n 


>, xi\2 


—2 i=1 
Xx 
nN 


The standard deviation is often taken as a measure 
of the uncertainty in the quantity involved. (a) Use 
this to estimate the minimum energy of an electron 
confined in an atom of radius 107 ?° m. (b) How does 
the answer to part (a) compare with the kinetic en- 
ergy of the electron in the hydrogen atom? (c) Repeat 
the calculation of part (a) for an electron confined to 
a nucleus of radius 107 * m. (d) The binding energy 
of the nuclear constituents is a few MeV (10° eV). 
What conclusion can we draw about the presence 
of electrons in the nucleus? 

(Answer: (a) 11 eV, (c) 1.1 xX 10? eV.) 


19.22 Use the uncertainty principle to show that 
the minimum énergy of a particle undergoing simple 
harmonic motion in one dimension is of the order 
of hv, where h is Planck’s constant and v is the fre- 
quency of the motion. Take the standard deviation 
presented in Problem 19.21 as the uncertainty in the 
position and in the momentum of the particle, that 
is, 


and _ 
(Ap,)* a Cox = pr _ Dx 


Note that in this case xX° = 0 and px = 0. 


CHAPTER 20 


An Introduction to 
the Methods of 


Quantum Mechanics ° 
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20.1 INTRODUCTION 


So far, we have discussed some of the experimental evidence that led to the 
breakdown of classical physics and to the beginning of quantum mechanics. 
We have seen how the introduction of the quantization postulates explained 
the experimental facts concerning blackbody radiation, the photoelectric ef- 
fect, and the hydrogen spectrum. These theories constitute what we call today 
the old quantum theory (OQT). 

Despite its successes, the OQT has some serious deficiencies: 


1. The theory can be applied only to periodic systems (harmonic oscillators, 
circular motion, and such), although there are many important physical 
systems that are not periodic. 


2. Although the Bohr theory predicts the observed wavelengths of the spec- 
trum of hydrogen, it does not explain why certain wavelengths are more 
intense than others; that is, it does not account for the rate of transition 
between different energy levels. 


3. The Bohr theory explains well the spectrum of monatomic hydrogen (H), 
singly ionized helium (He™), and reasonably well those of the alkali ele- 
ments (lithium Li, sodium Na, potassium K, . . .), but only because these 
are H-like atoms (as will be shown in Chapter 21). It fails to explain the 
spectrum of even the simplest of the multielectron atoms, He. 


4. But perhaps the most serious criticism of the OQT is that it is intellectually 
unsatisfying; it is not a unified or a general theory. It assigns to microscopic 
particles a well-defined path that, by the uncertainty principle, is not pos- 
sible. | | 


20.2 THE SCHRODINGER THEORY 
OF QUANTUM MECHANICS 


20.2a The Time-dependent Schrodinger Equation 


The basis of the modern theory of quantum mechanics was developed in 1925 
by Erwin Schrédinger (1887-1961); an equivalent, but mathematically differ- 
ent, theory was presented just about the same time by Werner Heisenberg. 
In this book we will deal only with the Schrédinger theory. 

The most important fact that we have presented so far is that the behavior 
of a microscopic particle is governed by the wave associated with it. In trying 
to find the wave associated with the particle, de Broglie’s postulates give us 
the first guideline. We have seen that if a particle has a well-defined mo- 
mentum and energy, we can use a sinusoidal traveling wave, that is, either 


uw = Asin (kx — wt) or wW = Acos (kx — oat) 


or a linear combination of both. As we have seen, if we want to describe a 
free particle, which is partially localized, we could use a wave packet. 


Erwin Schrodinger (1887-1961). 
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De Broglie’s hypothesis does not tell us what type of wave one can associate 
with a particle that is not free and that is acted on by a force. If a particle is 
acted on by a force, its momentum and its energy will not be constant. 
Therefore, it is meaningless to talk about a A and a v, because these quantities 
are changing. The Schrédinger theory tells us how to obtain the wavefunction 
u(x,t) associated with a particle, when we specify the forces acting on the 
particle, by giving the potential energy associated with the forces. (In quantum 
mechanics the potential energy is often referred to simply as the potential). 
The Schrédinger theory also tells us how to extract information about the 
particle from the associated wavefunction. 

Schrédinger developed a differential equation whose solutions yield the 
possible wavefunctions that can be associated with a particle in a given phys- 
ical situation. This equation, known as the Schrédinger equation, tells us how 
the wavefunction changes as a result of the forces acting on the particle. 
Because the wavefunction w& is a function of space and time, the equation 
contains derivatives (remember that a derivative represents the rate of change) 
with respect to x, y, and z and with respect to ft. In this chapter we will 
primarily consider motion in only the x direction. In mathematics, when a 
function depends on more than one independent variable, the derivative of 
the function with respect to one of them, while treating the other variables 
as constants, is called the partial derivative with respect to that variable. The 
partial derivative of w with respect to x is written as dy//dx, with similar notation 
for the partial derivative with respect to t. Several plausibility arguments can 
be used to arrive at the Schrédinger equation. We present here one that is 
based on the idea, to be demonstrated later, that there is a relation between 
mathematical operators and physical quantities. (In mathematics an operator 
is represented by a symbol or group of symbols and indicates an operation 
to be performed. Thus, for example, when the operator — if 0/dx is placed in 
front of a function, it indicates that the function is to be differentiated with 
respect to x and then multiplied by —ih, where 7 is the imaginary number 

= 1) 

The total energy of a particle is equal to the kinetic energy plus the 

potential energy, 
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Multiplying both sides of this equation by the wavefunction & we obtain 
p? 
= ao E 20.1 
EW = + Ep (20.1) 
Later in this chapter (see Section 20.2c) we will see that there is a relation 


between the energy E and the operator if 0/dt and between the momentum 
p and the operator — ih d/dx. By this we mean that, operating on the function 
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with ih d/at and with —ih o/ax is the same as multiplying the function by E 
and p, respectively. Substituting these operators for E and p in Eq. 20.1, we 


have 
ab 1 _@ _ 9 
— = — (| -ih — —ih — + E 
at = ( m =) ( =) ‘ pl 


or, because 1 and f are constants, 


(20.2) a 


om ax? 


Equation 20.2 is known as the one-dimensional time-dependent Schrodinger equa- 
tion. If the potential energy E, is known, this equation can be solved in 
principle, and the solution will yield the possible wavefunctions that we can 
associate with the particle. The Schrédinger equation is to quantum mechanics 
what Newton’s second law is to classical physics. 

One sometimes finds “proofs” of the Schrédinger equation. These so- 
called proofs are simply plausibility arguments. Somewhere along the argu- 
ment, at a critical point, the argument becomes a postulate. For example, in 
the argument we just presented to arrive at Eq. 20.2, we used the fact that 
there is a relation between E and ih o/ot and between p and — ih d/dx. We will 
prove this for the free particle case. When the particle is not free, the rela- 
tionship cannot be proved, but we postulate that it still holds and experiment 
bears this out. The Schr6dinger equation cannot be derived from first prin- 
ciples; it is a first principle, which cannot be mathematically derived, just as 
Newton’s laws of motion are not derivable. The justification lies in the fact that 
its predictions agree with the experiment. 


20.2b The Schrédinger Equation for a Free Particle 


Let us consider a free particle moving along the x axis with definite momentum 
p = mv and definite energy E = 1/2 mv’. If no force acts on the particle, that 
is, F = 0, the potential energy is E, = constant, which we can choose to be 0. 
Thus, the condition F = 0 requires that E = constant. The Schrodinger 
equation (Eq. 20.2) in this case may be written for E, = 0 as 

h> arp — , ob 

ee SS 20. 

2m ax2 | at a) 
As mentioned at the beginning of Section 20.2a, because we are dealing with 
a particle of well-defined momentum and energy, we might expect that the 
solution would be in the form of a traveling wave, that is, either 


w = Asin (kx — wt) or Ww = Acos (kx — at) 


. ou 
+ Eh = th = 
at 
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or some linear combination of these two functions. If one tries either of these 
by substitution into Eq. 20.3, one finds that neither satisfies the Schrodinger 
equation. The reason is that when you differentiate a sine function twice with 
respect to x, you get the sine function back, but when you differentiate it 
with respect to time once, you get a cosine function. For example, let us 
consider # = A sin (kx — ot). Differentiating with respect to x, we first get 


us 


) ; = 
ae [A sin (kx — wt)] = kA cos (kx — at) 


and the second differentiation yields 


ary 


ax? 


= = kA cos (kx — wt)] = ~RA sin (kx — ot) 


The derivative of ~# with respect to tf yields 


aw oa Z 
hen [A sin (kx — wt)] = —wA cos (kx — at) 


Substituting these results for d7/dx* and dy/dt in Eq. 20.3, we obtain 


k*h? 

— A sin (kx — wt) = —thw A cos (kx — ot) 

2m 
Because the sine and cosine functions are equal only for certain angles (for 
example, 45°), this cannot be satisfied for all x’s and ?t’s. Similar results are 
obtained with the cosine function. There is, however, a particular combination 
of these two functions that does satisfy the Schrodinger equation. This com- 
bination is 


w = A [cos (kx — wt) + 1 sin (kx — at)] 
which, by mathematical definition, may be written as 
b= A elo!) (20.4) 


We may show that Eq. 20.4 satisfies the Schrédinger equation by substitution 
into Eq. 20.3. Differentiate Eq. 20.4 twice with respect to x 


= po < [A geo) aa ik A ellkx — wt) 
x x 
2 
“e — - [ik A aiaalcie)| = (ik)? A ellkx — ot) = — k2 A ellkx — et) 
x x 


The derivative with respect to f yields 


oy = 9 [A el -o)] = iw A eile ot) 


ot ot 
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and, substituting into Eq. 20.3, we get 


~ A eilkx — wt) = hw A ellkx — wt) 
mM 
or 
h7k 
re hw (20.5) 


which means that Eq. 20.4 is a solution for this relation of the constants. We 
must now show that Eq. 20.5 is consistent with de Broglie’s hypothesis. The 
kinetic energy of a particle is given by E = 1/2 mv? = p?/2m. From de Broglie’s 
hypothesis, 


E=hy=h— = ho 
21 
h h 
p= 2a 
k 
p? 
Substituting these relations into the expression for the energy, E = a’ WE 
get me 
h7k? 
hw = — 
8 Om 


which is the same as Eq. 20.5, and the consistency of the formulation has 
been demonstrated. That is, for a free particle of definite momentum and 
energy, the wavefunction y% that satisfies the Schrodinger equation and 
de Broglie’s postulate is given by Eq. 20.4. The fact that Eq. 20.4 is not a real 
function (that is, it has an imaginary component) is not a problem. The wave- 
function itself has no physical meaning. We have seen (see Eq. 19.3) that it 
is ||? that has physical significance. When i is a complex function we write 
by mathematical definition that 


WP = ory 
where y* is the complex conjugate of w. (The complex conjugate of a function 
ws is a function whose real part is the same as the real part of # and whose 


imaginary part is the negative of the imaginary part of w.) In the preceding 
case, 


| wl? = wt = Are ilkx—ot) A pilkx—ot) 
= A*A 


This result is a real quantity, which it must be by Eq. 19.3, because an ima- 
ginary probability is not defined mathematically. In this equation the prob- 
ability of finding the particle at any point in space is given as |2dV, and 


| uh = hob 
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therefore |¥|?, when properly normalized, (Eq. 19.4), may be considered a 
probability density that must be both real and positive. 


20.2c Expectation Values 


The preceding solution of Eq. 20.2 was for the case of E, = 0. The method, 
however, is valid for all functional forms of E,, although the solution of (x, t) 
is more complicated. The following discussion applies to all solutions of Eq. 
20.2, regardless of the form of E,. 

One of the consequences of the uncertainty principle is that the position, 
momentum, energy, and so forth of a particle cannot, in general, be precisely 
determined. We can, however, talk about the average value of these quan- 
tities. Average values in quantum mechanics are called expectation values. The 
physical significance of the expectation value is the following. Suppose we 
take a large number of particles, all of them described by the same W(x, t). If 
we measure the value of a particular dynamical quantity, for example, the 
momentum p, we will find that, in general, p is different for different particles 
even though all are described by the same yx, ft). We can, however, calculate 
the average value of p. If we repeat our experiment with another system of 
a large number of particles each described by the same u(x, t), we will find 
that the average value of p will be the same as in the first experiment. Ex- 
pectation values do have physical significance as a statistical average even in 
cases where the exact value of the individual dynamical quantity is not well 
defined. 

The question that we may ask is: Can we determine these expectation 
values without doing the experiment if we know the wavefunction y(x, t) 
associated with the system? The answer is yes. 

The average position of a system of particles is usually obtained by the 
standard statistical method of adding the positions of all the particles and 
dividing this sum by the total number of particles. If we have Nrota particles 
distributed along the x axis in such a way that there are N; particles at x,, N>2 
at X2, N3 at x3, .. ., then the average particle position x will be given by 


XiN, = XoN> a x3N3 fee 


x= 
Nrotal 


> xiN; 


This expression is analogous to the one used in Chapter 6 (Eq. 6.2) for the 
center of mass of a system of particles. The only difference is that the weight- 
ing factor here is the number of particles at each location rather than the 
mass. 

Suppose that instead of a number of particles we have a single particle 
and that it has different probabilities of being found at different locations, for 
example, probability P,; of being found at position x,, P2 at x2,.... Also, 
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because the particle must be somewhere, we know that the sum of all the 
probabilities must be unity, that is, 

Pi, +P, + Pg ts+'=1 


This statement about the sum of probabilities bears the term that they are 
normalized. The expression for the average or mean particle position can now 
be written as 


X1P, + XP. + x3P3 + °°> 


x 


Or 


S xP; 


When the intervals Ax = x; —x;~1 approach zero—that is, the position that 
the particle may be at varies continuously—the preceding summation be- 
comes an integral and 


xX 


X = ia xP(x) dx 


a 


As in the case of the discrete probabilities, P,’s, the continuous sum of prob- 
abilities must be normalized, that is, 


[. P(x) dx = 1 


If the particle’s position varies with time, then the probability of finding the 
particle between x and x+dx is a function of time and is written as P(x,t) dx. 
The average value of x will be 


[ . xP(x,t) dx 


¥ = 
[ P(x,t) dx 


where the denominator has been included to ensure that the P’s are properly 
normalized. 

In Chapter 19, Eq. 19.3, we saw that the probability P(r,t) dV that a particle 
be found within a small volume dV centered around a point with position 
vector r was |f|"dV. In the special case under consideration, where the particle 
is restricted to the x axis, the probability P(x,t) dx of finding the particle at 
time t between x and x + dx will be ||?dx. We mentioned earlier that when 
ws is a complex function, |y|? = ~*w, where ¥* is the complex conjugate of . 
Thus, 


| _ xy*w dx 


fT teas 
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where the denominator ensures that the probability is normalized. If we have 
wavefunctions w that we know are normalized we do not bother to write the 
denominator because we know that it equals unity. It is customary to write 
the numerator with the x between the two wavefunctions as 


a [ wx dx (20.6) _ 2 | | de* xu dx 


Equation 20.6 tells us that the average value of x is the sum of all the possible 
values of x weighted by the probability that the system may be found at each 
possible value. For this relation to be valid, # must be normalized so that the 
sum of their probabilities for all x is unity, that is, 


[. pry dx = 1 [ ley dx = | 


Not only can we get the average value of x from & by this method, but 
we can similarly find the average value of any dynamical quantity that can 
be written as a function of x, for example, the average value of the potential 
energy E,(x) is 


E, = [. WE (x) ys dx E,, = [ : Wi E(x) ax 


It should be clear why this expression is correct. If the particle is at a given 
x, it will have the potential energy E,(x) associated with that x. Thus *ydx 
is not only the probability that the particle be found between x and x + dx 
but also the probability that it will have a potential energy between E,(x) and 
E,(x + dx). The same applies to any other dynamical quantity that can be 
expressed in terms of x. 

There are quantities, such as the momentum and the total energy, that 
in classical physics can be expressed in terms of the coordinates of the par- 
ticles. In quantum mechanics, however, this is not permitted because it would 
violate the uncertainty principle. In Chapter 19, when we introduced the 
uncertainty principle, we indicated that it is not possible to determine exactly 
both the position and the momentum of a particle. Therefore, we cannot 
express the momentum as a function of x, that is, p(x). How can we get 
information about the momentum of the particle? Let us consider the free 
particle wavefunction that we found previously (see Eq. 20.4). This wave- 
function represents a particle with definite momentum p = hk and definite 
energy E = hw. Let us differentiate the wavefunction given in Eq. 20.4 with 
respect to x and then multiply both sides by —1th. 


oy 


os eee ik A ellkx — wt) 
Ox 
—ih oy = (-1h)ikA Bere) ea he Ae Oo 
Ox 
4) 
—ih sid = pw (20.7) 
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This result tells us that ai is an association between the dynamical quantity 
p and the operator — ia 2 — - The effect of multiplying the wavefunction w by 
ox 
0 
the momentum p is the same as operating on wW with the operator — ii — 
and the equivalence may be a as ax” 
7) 


=> —ih — 


Ox 


Let us now return to our method of calculating expectation values, Eq. 20.6. 
The method consists in first multiplying the wavefunction w by the quantity 
in question (x, E,(x), p, ...), we then multiply the result by the complex 
conjugate of the wavefunction * and integrate over all space. When we try 
this method for the momentum p, we run into trouble because we cannot 
express p in terms of x. However, as indicated by Eq. 20.7, instead of mul- 


. a) 
tiplying w by p, we could operate on w with — ih ae because the effect is the 


same. We obtain the expectation value of p for a particle moving along the x 
axis as follows 


So el oe 
p= [9 ( ih =) yp dx (20.8) 


Let us apply this to the free particle wavefunction, Eq. 20.4, 


- ) 
* a? i(kx — wt) 
[. wy ( ih =) Ae dx 


p = 

p= [ot (-ihik A eo) dx 
5 = hk [ wid dx 

5 = hk 


which is a form of de Broglie’s relation. In the last step, we made use of the 
normalization condition mentioned earlier; that is, the probability of finding 
the particle anywhere in space must be 1. 


[. W*wdx = 1 


This result is mathematically consistent; that is, the average or expectation 
value of something that has a definite value is that value itself. 

A similar method can be used to find the average value of the energy. 
Let us again consider the free particle wavefunction, Eq. 20.4, associated with 


‘Although the wavefunction = A e**~ is not normalizable, it can be normalized by making 
the limits of integration very large but not infinite. 


= a \ 
us* ( — ih - ] us ax 
3 ’ ox | 
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a particle of definite energy E = hw. 
ws =A eilkx — wt) 
differentiate both sides with respect to ft 


oe 
at 


_ —iwA ellkx — wt) 


multiply both sides by ih 


ih = = (ih)( — 1) A eilkx — wt) = hw A eilkx ~ wt) 
9 
ih w= Ey (20.9) 


We see that just as in the case of the momentum, there is an association 
between the energy of the particle and an operator, in this case the operator 


0 
is ih a The effect of multiplying the wavefunction ~ by the energy E is the 


; ; ee 
same as operating on & with the operator ih a and the equivalence may be 
expressed as 
EB eae 4 ih =. : d 


E <> ih — 
t fil ey 


Therefore, the average or expectation value of E is obtained by the same 
method of finding averages, namely 


E= [. w* E wh dx 


Tr = * : 0 | 
B= [uy (i 4 w dx (20.10) 


Let us apply this to our free particle wavefunction, Eq. 20.4. 


_ = 0 

— * i? il i(kx — wt) 
E [9 (i *) Ae dx 
E= ii yV (ih)(—iw) A el) dx 
E = ho [ w*w dx 
E = ho 


This result is mathematically consistent because the wavefunction represents 
a particle of definite energy E = fw and the average value of something that 
has a definite value is the value itself. 
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20.2d Time-Independent Schrodinger Equation 


The Schrodinger equation in one dimension is a partial differential equation 
that involves t and x as the independent variables. One standard technique 
for solving partial differential equations is the method of separation of variables. 
The method consists in trying a solution that is a product of two functions: 
one a function of x alone and the other a function of ¢ alone; that is, let 


W(x,t) = x(x) P(é) (20.11) Ux,t) = x(x) T(t) 


In the cases in which this method works, the partial differential equation is 
reduced to two ordinary differential equations, each one involving only one 
of the independent variables. In the case of the Schrédinger equation, Eq. 
20.2, the method works if the potential energy E,, is a function of x alone, 
that is, E,(x). 

In the remainder of this section we are going to show that Eq. 20.11 
satisfies the Schrédinger equation. Let us substitute it into Eq. 20.2, 


h,2 h* dry oy 
Om ax? + oe ae at 
= d? 
5 T() 2S + Eo) = ito) 


If we divide through by x(x)I'(#), we get 


1 dI(t) 
T(t) dt 


hf 1 dx(x) 
2m y(x) dx? 


+ E(x) = ih —— 


The right side of the equation is a function of ¢t alone, and the left side is a 
function of x alone. Because x and t are independent variables, the equation 
will be correct if both sides are equal to the same constant G which is called 
the separation constant, that is, 


h? 1 dy 
ee a ; 
Sere p(X) G (20.12) 
and 
. 1 av 
ih rp di = G (20.13) 


We now have two ordinary differential equations, one involving x alone and 
the other involving ¢t alone. We should point out that the equation for I(t), 
Eq. 20.13, does not involve E,(x). This means that the solution for I(t) is the 
same in all physical situations where E, does not depend on time. The dif- 
ferential equation for I(t) can be solved easily by simple integration, 

av G iG 


THe — 3 at 
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Integrating, we get 


_G 
In — 1 a t + constant 


T(t) = Ke-ist* (20.14) 


where K is the natural antilog of the integration constant. Because it is an 
arbitrary constant, we will set it equal to unity (1) and drop it from the 
equation. We may evaluate the constant G by the following method. Let us 
operate on the wavefunction (x,t) with the energy operator ih 9/dt 


0 a a, 
ih 7 U(x, t) = ih F y(x)P() 


ra) 
ih — y(x) e~ iG 
at x( ) 


= ih x(x) (-i i) eW iGeih 
G y(x)P(E) 


G w(x, t) (20.15) 


0 
th Fi U(x, t) 


Comparing Eq. 20.15 with Eq. 20.9, we see that the separation constant G is 
the total energy E of the system. Thus the time-dependent part of the wave- 
function (x, t) is, 


T(t) = e7 ie (20.16) r() = ee 


If we now wish to find what the space-dependent part of the wavefunction 
is, we have to solve the differential equation (20.12), where G is now E. 
A2 2 A> d*y a 
= 57 4 + E,(x)x = Ex (20.17) om a + E(x)x = Ex 

Equation 20.17 is called the time-independent Schrodinger equation. For a given 

E(x), the equation has to to be solved to find the possible y(x)’s that can be 

associated with the system. So far, there are no restrictions on the values of 

the total energy E of the system. Thus, for different E’s we expect to get 

different functions y(x). We will see, however, that x(x) must satisfy certain 

requirements that we discuss next. These requirements will, in some cases, 

restrict the number of physically acceptable x(x)’s and consequently the values 

of E that the system may have. In the Schrédinger theory, energy quantization 

is not introduced as an arbitrary postulate, as in the Bohr theory; it is a 

consequence of the fact that the wavefunction associated with the particle 

must be well-behaved, a term defined in the next section. The solutions y of 

Eq. 20.17 are called the eigenfunctions or eigenstates; the corresponding values 

of E are called the eigenvalues (in German, “eigen” means “proper’’). 
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20.2e Required Properties of the 
Eigenfunction ;(x) and its Derivative 


We just mentioned that x(x) must be well-behaved. This means that the eigen- 
function x(x) and its derivative dy/dx must have the following properties: 


1. They must be finite everywhere. Figure 20-1a shows a function f that be- 
comes infinite at x = xo. Such a behavior for y or dy/dx is not acceptable. 


2. They must be single-valued everywhere. In Figure 20-1b we show a function 
f that is not single-valued everywhere. A y or dx/dx exhibiting such behavior 
would not be acceptable. 


3. They must be continuous everywhere. Figure 20-Ic illustrates a function f 
with a discontinuity at x = xo. Again, if y or dy/dx behaved in such a 
manner, that eigenfunction would not be acceptable. (In some special cases 
dx/dx is not continuous.) 


When these conditions are satisfied, the eigenfunction and the associated 
wavefunction (remember that (x, t) = x(x)I(f)) are said to be well-behaved. 
Why must these conditions be satisfied? 

Recall that (see Eqs. 20.6 and 20.8) 


and 


If y or dy/dx are not finite, the average value of the momentum will be infinite; 
infinite momentum means infinite energy; this is not physically possible. 
Similarly, if y or dy/dx were not single-valued, the average position and the 
average momentum would not be defined. The exact value of x and p may 


FIGURE 20-1 

Examples of functions that are not well-behaved. (a) A function that becomes infinite 
at x = X,. (b) A function that is not single-valued everywhere. (c) A function with a 
discontinuity at x = Xp. 
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be uncertain, but the averages have to be defined. The reason is that if we 
make a set of measurements on a very large number of identical systems, 
that is, systems described by the same (x, t), we will get a consistent, well- 
defined value for the average x and the average p. The calculation of these 
averages must reflect the experimental fact, that is, (x) and dy/dx must be 
single-valued. 

The requirement of continuity is closely tied to the requirement of fi- 
niteness. If y is not continuous at a given point in space, then dy/dx would 
be infinite at that point. This, as we have indicated, is not allowed. If dx/dx 
is discontinuous at some point, then d*y/dx* would be infinite at that point. 
From the time-independent Schrédinger equation, Eq. 20.17, this would imply 
that either E or E, is infinite, which is not physically possible. 

In the case of bound systems (one in which the particle is restricted to 
move over a finite region of space), these requirements on y lead to the 
quantization of the energy and of other physical quantities of the system. 
This fact will become clear when we consider some simple cases. 


20.3 APPLICATION OF THE SCHRODINGER THEORY 
20.3a Particle Inside an Infinite Potential Well 


Let us consider a particle such as an electron confined inside a one-dimen- 
sional well of length a with infinitely high walls (see Fig. 20-2). 


ofor0>x>a 
E,(x) = paeucas 


Let us write E, (outside) as E,,. Since E,. is ©, one must provide an infinite 
amount of energy to pull the particle out of the well. This means that the 
particle cannot get out. This, obviously, is a highly idealized model. The 
reason for first considering the idealized model is that it is mathematically 
much easier to solve than are physically realistic ones. However, the main 
features of both idealized and realistic models are not really very different 
because E,,. can be very large while remaining finite. We will see when we 
apply the model to the electrons in a solid that the predictions based on the 
present approximation of E,, = ~ agree quite well with the experimental 
results. 

Our task is to find all the possible wavefunctions that can be associated 
with the particle inside such a well. Before we do this, let us answer the 
question: What is the eigenfunction y outside? The answer is xy = 0. The 
reason is that for the particle to get out, an infinite amount of energy must 
be provided. This is physically impossible, and therefore |x|? = 0 outside the 
well, and we may state that 


y = O0Ofor0=x=a 


FIGURE 20-2 
An infinite potential well. 
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To evaluate y inside, we must solve Eq. 20.17 for the case when E, = 0, 
og ey (20.18) 


Divide both sides by — h?/2m and put both terms on the left side to get 


d*y 2mE 
de 7 pe X=” 
or 
“x 
oad ae (20.19) 
where 
2mE 
2 = 7 (20.20) 


Note that k in Eq. 20.20 is the wave vector k = 27/A of Eq. 11.12. This can 
be readily shown. Because in this case the total energy E is simply the kinetic 
energy, 


_! oF 
ea a 
From de Broglie’s hypothesis, Eq. 19.1, 
h h 
=—-— = — = hk 
P A Qa 
k 
Therefore 
2 2]-2 
ee ans 
2m 2m 
and 
2mE 
ke = 52 


which is the same relation as Eq. 20.20. 

Equation 20.19 is a second-order differential equation with constant coef- 
ficients. Although mathematical methods are available for the solution, we 
may solve it simply by trial substitution of an assumed solution. Try a solution 
of the form, 


xy = en (20.21) 


where a is a constant as yet undetermined. Differentiating Eq. 20.21 with 


APPLICATION OF THE SCHRODINGER THEORY 


respect to x, we obtain 


dx _ d ax — qe%X 

dx ax 

d*y d ; 
ee ax — eax — 2 
dx dx ° oa 


Substituting for d?y/dx? into Eq. 20.19 we have 


avy + ky = 0 
a2 +k? =0 
a= + ik 


The two solutions are therefore e“* and e~“*. The general solution will be the 
sum of these two or 


y(x) = ae + b e~ ike 
where a and DB are arbitrary constants. By mathematical definition 


a &* = acos kx + ia sin kx 
be ** = bcos kx — ib sin kx 
Therefore y(x) can be written as 


x(x) = (a + b) cos kx + i(a — b) sin kx 


and, because a and BD are arbitrary constants, we simplify the notation by 
introducing new constants A and B, 


x(x) = Acos kx + B sin kx (20.22) 


A and B are now the arbitrary constants to be determined from physical 
considerations. 

Thus far, no restrictions have been found as to the values that k and 
therefore E can take. However, as soon as we require that y be well-behaved, 
the restrictions on k and E will appear. 

The conditions of finiteness and single-valuedness on y and dy/dx are 
satisfied by the y of Eq. 20.22. The function is finite and single-valued for all 
values of x. 

The condition that y be continuous requires that Eq. 20.22 be 0 at x = 0 
and at x = a, because y is zero outside the well and from this we evaluate 
the constant A and the values of k. Because 


x(x) = 0 forx = 0 
then Eq. 20.22 becomes 
0 = A(1) + B(0) 
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It follows that 


A=0 4 16 E 

and therefore 

x = Bsin kx (20.23) ff j 
Furthermore, because 7 9E> 

y=0 forx=a 

then Eq. 20.23 becomes 2-4 Ep 

0 = B sin ka ; ; 
There are two possible ways to satisfy this latter condition. Either B = 0 or FAURE 209 


sin ka = 0. The first choice is not an attractive one; if B = 0, then y = 0 
everywhere. This means that the particle is not in the well. The only mean- 
ingful way to satisfy the condition of continuity is when 


The first four allowed energy lev- 
els for a particle in an infinite po- 
tential well. 


sinka = 0, thatis, ka = 0, 7, 2m, 3m,++° 


or 
qT 
k=n—  n=1,2,3,++° (20.24) 


Notice that n = 0 is not an acceptable choice. If n = 0, then k = 0 and from 
Eq. 20.23 x would be zero everywhere; that is, the particle could not be in 
the well. 

If we substitute the value for k from Eq. 20.24 into Eqs. 20.23 and 20.20, 
we will get a set of eigenfunctions and the corresponding set of eigenvalues 
that we can associate with the particle in the well. 


WT a iv 
Xn, = Bsinn A x (20.25) y, = Bsinn = x 
and from Eq. 20.20, 
_ AB _ na? He 
Wm a Om 


and the possible values of E are 


wh es Th" 
(20.26) cam 2ma? 


E, = n°Eo where Ep, = 
2ma* 
We should note that the first derivative of the eigenfunctions in this case is 
not continuous at x = 0 and at x = a. In this idealized example E, = ~ at 
x = Oand x = a, and the requirement that the first derivative be continuous 
does not hold. 
Let us analyze the results that we have obtained. The obvious result is 
that unlike the classical case of a particle bouncing between two walls, where 
the velocity and therefore the energy can have any value whatsoever, the 
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quantum mechanical result restricts (quantizes) the values of the energy. The 
first few allowed energy levels are represented schematically in Fig. 20-3. 
Thus, we see that in the Schrédinger theory energy quantization does not 
come in as a postulate; rather, it is a direct consequence of the fact that a 
particle is described by a wave and that the wave must satisfy certain con- 
ditions to be a good representation of the particle. 

The model that we have been discussing could be used to describe a 
particle or system of particles confined inside a container with rigid walls. 
Classically, the internal energy of such a system of particles can be controlled 
by changing the temperature. Recall that the average energy of an atom in a 
gas is E = 3/2 kgI (Eq. 9.21), where kg is the Boltzmann constant and T the 
absolute temperature. At absolute zero temperature the energy would be zero. 
In quantum mechanics, as we can see, this is not possible. Even at absolute 
zero the energy cannot be zero, because E = 0 is not an allowed energy state 
if the particle is in the well. The smallest energy that the particle can have is 
Eo. This result comes out of the solution of the Schrédinger equation, but we 
could have guessed it from the uncertainty principle. If E = 0, then p = 0, 
and therefore Ap = 0. From the uncertainty principle it follows that Ax = 
co, But Ax cannot be ~ because the particle is in the well; the largest Ax can 
be is a, the size of the well. It follows that Ap #0, hence p #0 and E #0. 

The solution of the Schrédinger equation has given us a set of eigen- 
functions y that can be used to describe a particle in the potential well. It 
does not tell us what particular y is associated with the particle. Which par- 
ticular y one assigns to it depends on how the particle was placed in the well 
and what is done to the particle afterward. If we leave the particle alone, it 
will, following the tendency of all physical systems, tend to go to the lowest 
energy state available, which is called the ground state or E = Ep (provided, 
as we will see later in Chapter 23, that that state is not already occupied by 
another particle). In the present case of lowest energy, the eigenfunction 
representing the x-position of the particle, will be, from Eq. 20.25 


v(x) = Bsin 7 x (20.27) 


We note that when the x(x) of Eq. 20.27 is multiplied by the time part of the 
wavefunction, I(t) of Eq. 20.16, the resulting wavefunction (x, t) represents 
a standing wave of the type discussed in Section 12.8. In fact, a plot (see Fig. 
20-4) of the first four eigenfunctions (n = 1, 2, 3, 4), Eq. 20.25, reveals that 
they are identical to the allowed standing waves in a string (Fig. 12-19). 


Example 20-1 Normalization of the Wavefunction 


The arbitrary constant B in Eq. 20.25 is determined by the normalization 
condition; that is, the probability of finding the particle somewhere in space 
must be 1. In mathematical terms this fact is stated as follows: 


[. uw dx = 1 


ee 


x = 0 x=a 


FIGURE 20-4 

Plot of the first four eigenfunc- 
tions for the particle in the infi- 
nite potential well. 
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Evaluate the constant B for the ground state wavefunction of a particle in a 
one-dimensional well, that is, for the wavefunction 


Ua(x, t) = x1(x) P(t) = B sin @ exp (- - ) 


Solution For the particle in the well the limits of integration must be 


from 0 to a because the propanny of finding the particle in the well must be 
unity. Therefore, 


a rere mele 1E _ 1Eo 2 
5B sin? (=) exp (521) exp ( Pt) ax 1 


The integral can be found in standard integration tables. From them, we get 


a am 1 27x\ )* 
— RB fe es, —— —_ 1 
T = 4 oo ( a )} 


Substituting the limits, we obtain 


OF 


Example 20-2. Probability Calculations 

We indicated previously that all the information about the particle can be 
found from the wavefunction. Consider a particle in the ground state, that 
is, one represented by the wavefunction of Example 20-1. What is (a) the 


average position, (b) the average momentum, and (c) the average energy of 
such a particle? 


Solution 


(a) From Eq. 20.6, the average position is given by 


= | vt xh dx 


(b 


~~ 


which in the present case becomes 


eI 


eed 


x 


x 


NG) 
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x=0 x 
FIGURE 20-5 
Example 20-2. 


This result makes physical sense and would have been predicted by 
looking at a graph of the probability density jy, which is plotted 
in Fig. 20-5. It is clear that the probability of finding the particle on 
the left side of the well is the same as the probability of finding it on 
the right side. The average position is therefore the midpoint x = a/2, 
which is the result found in the preceding calculation. This result is 
true not only for n = 1 but for all n’s. For example, the square of the 
eigenfunctions shown in Fig. 20-4 will be symmetric about the mid- 


point of the well. 


The average value of the momentum can be found using the method 
outlined in Section 20.2c of this chapter, Eq. 20.8. 
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(c) 


This result also makes physical sense. The particle is moving back 
and forth between the walls of the well. The probability of finding 
the particle moving toward the right is the same as the probability 
of finding it moving toward the left. Thus the average value of the 
momentum has to be zero. 


The average value of the energy E can be calculated using Eq. 20.10 


2 a (24 
E= ['w (® =) Wr, dx 
0 (, 8 z 
-|u (i 5) xa(x) exp (-iZ 
| E 
= i ut (ih) (-i =) Xi(x) exp (-iF4) dx 


= |" Eo ut th de 
E = Ej 


In the last step, we made use of the fact that i . ue uy dx = 1. The 


result should come as no surprise. We indicated that when the particle 
is in the ground state, the eigenfunction associated with the particle 
was given by Eq. 20.27. In this case the particle has a well-defined 
energy E = Ey. We expect therefore that the average value will be 


the actual value. 


PROBLEMS 


20.1 Show by direct substitution into the time-de- 
pendent Schrédinger equation for the free particle, 
Eq. 20.3, that w(x, t) = A cos(kx — wt) is not a so- 
lution. 


20.2 Show by direct substitution that the wave- 
function (x, ft) = A cos kx e~™ satisfies the time- 
dependent Schrédinger equation for the free parti- 
cle, Eq. 20.3. 


20.3 Explain why the following eigenfunctions are 
not acceptable solutions of the Schrédinger equation 


(a) x(x) =0 forx=0 
x(x) = Acoskx forx=0 


eikx 
(b) x(x) = A 


(c) x(x) = Aln kx 


20.4 We saw that there are two solutions for the 
eigenfunction of a particle in an infinite potential 
well, e* and e~**. Show that the sum y = a e** + 
b e~'** is also a solution, where a and D are arbitrary 
constants. — 


20.5 (a) What is the minimum energy of an electron 
confined in a one-dimensional well of width 1 A, 
that is, the approximate size of the atom? (b) How 
does this energy compare with the binding energy 


of an electron in an atom? (c) What is the minimum 
energy of an electron confined in a one-dimensional 
well of width comparable to the nuclear diameter, 
that is, 10~'* m? (d) The binding energy of the nu- 
clear constituents is a few MeV (10° eV), what can 
you conclude about the existence of electrons in the 
nucleus? 

(Answer: (a) 37 eV, (c) 3.7 X 10? eV.) 


20.6 (a) What is the minimum energy of a neutron 
confined in a one-dimensional well of width com- 
parable to the nuclear diameter, that is, 10~ * m? (b) 
Is it reasonable to assume that there are neutrons in 
the nucleus? The binding energy of the nu¢tear con- 
stituents is a few MeV (10° eV). 


20.7 (a) Under quantum conditions, what is the 
minimum energy that a 100-kg person can have when 
confined to a closet 1 m long? (b) If the person moves 
with a speed of 10~* m/sec, what would be his quan- 
tum number n? 

(Answer: (a) 5.4 x 10~7° J, (b) 3 x 10°.) 


20.8 Evaluate the energy of an electron inside a 
one-dimensional well of width 2 A for the states 
n = 1, 2, 10, 11, 100, 101, 1000, 1001. What conclu- 
sion can you draw about the fractional difference in 
the energy 


between adjacent states? 
(Answer: E, = 9.34 eV, E. = 37.47 eV, 
Exo = 9.34 x 10? eV, E,, = 11.30 x 10? eV, 
E100 = 9.34 X 104 eV, F401 = 9.53 x 10? eV, 
E,000 = 9.34 X 10° eV, E100. = 9.36 X 10° eV.) 


29.9 For the one-dimensional well, show that 


‘ baag ~ ae 
ni 


no Es 


= 0 


29.19 Anelectron in one of the excited states of the 
infinite potential well can emit photons when it de- 
cays to lower energy states. Assume that only tran- 
sitions in which An = +1 are allowed. Show that 
the frequencies of the photons emitted in these tran- 
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sitions are 


_ 3Eo 5Eo 7Ey On + 1)E 

TOA Ah ho 

ha 

2ma? 

29.1) Anelectron is confined in a one-dimensional 
well. Assume as in Problem 20.10 that only transi- 
tions in which An = +1 are allowed. The electron 
initially in the ground state absorbs a photon of fre- 
quency 2.4 x 10 Hz. (a) What is the energy spec- 
trum of the electron? (b) What is the width of the 
well? 

(Answer: (a) E,, = 0.33 n? eV, where n = 1,2,3,..., 
(b) 10.6 A.) 


where n = 1, 2,3,°°: and Ey = 


29.12. What is the probability of finding a particle 
in a well of width a at a position a/4 from the wall 
ifn = 1, ifn = 2, if n = 3. Use the normalized 
wavefunctions (x, t) = (2/a)'? sin naxla e7tEH*. 

(Answer: 1/a, 2/a, 1/a.) 


20.13 Calculate the expectation value for the co- 
ordinate x, the momentum p, and the energy E of a 
particle in an infinite potential well represented by 
the wavefunction of the first excited state, that is, 


W(x, t) = (2/a)" sin 2arxla e7 iF" 
(Answer: a/2, 0, E>.) 


20.14 An electron in an infinite potential well is in 
the third excited state, that is, y4 = (2/a)'? sin 4arx/a. 
What is the probability of finding the electron in the 
region between x = 0 and x = a/4? Answer the 
question directly from the graph of Fig. 20-4 without 
performing any calculations. Verify your answer by 
evaluating the integral 


a/4. 
p(osx=4) -| xi dx 


(Answer: 4.) 


29.15 AS we saw in Section 20-2c, we can associate 
the operator —ih o/dx with the momentum p of a 
particle. Similarly, we can associate the operator 
(—ih d/dx) (—ih d/ox) = — h? d?/dx* with the square 
of the momentum p”. Use this to find the expectation 
value of p?, that is, p*, for the lowest energy state of 
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the particle in the one-dimensional well. Use the 
normalized wavefunction of Example 20-1. (Hint: If 
you are thoughtful, there is no need to evaluate any 


integral.) ae 
(Answer: (%) .) 


20.16 Use the result of Problem 20.15 together with 
the fact that the kinetic energy of a particle is E, = 
1/2 mv* = p?/2m to evaluate the expectation value of 
the kinetic energy of a particle in the lowest energy 
state of the one-dimensional infinite potential well. 
How does your result compare with the energy Ep 
of that state? 


20.17 In Problem 19.21 we saw that the standard 
deviation o of a quantity is often used as a measure 
of the uncertainty in that quantity. The standard 
deviation of a quantity, for example, the momentum 
p is defined as 07 = p* — p’. (a) Use the result of 
Problem 20.15 for p? and that of Example 20-2 for 
p to find the uncertainty in the momentum of the 
particle in the lowest energy state of the well. 
(b) The particle can be anywhere in the well, there- 
fore Ax = a. Calculate Ap Ax, and compare the result 
with the uncertainty principle, Eq. 19.5. 

(Answer: (a) hala, (b) h/2.) 


20.18 Alpha particles (He nuclei) are emitted from 
certain nuclei. Consider an a particle with energy 
6 MeV (6 x 10° eV) in a nuclear well of width 
10~** m (see Fig. 20-6). How many collisions per 
second does it make with the walls? The mass of an 
a particle is 4 x 1.67 x 10-2’ kg. 

Epo E,0 


FIGURE 20-6 
Problem 20.18. 10-14 m 


20.19 The two-dimensional Schrédinger equation 
is 
h? (ax day 
—--—-(-S+-—-) +Ey,v=E 
2m = <4 pa oe 


Solve the Schrédinger equation for a particle con- 


fined in a two-dimensional infinite potential well of 
width and length a. In particular, show that the ei- 
genfunctions x and the corresponding eigenvalues 
E are given by 


NTx\ | (n 
Xan = Asin ( . ) sin ( a) 


2 


Th 
Enin = Oma (ni + n3) 


where n; = 1,2,3,... andn, = 1,2,3,... 


(Hint: Assume x(x, y) = y1(x) x2(y), and then use 
the separation of variables method discussed in Sec- 
tion 20-2d.) 


20.20 As we have seen in Chapter 19, one of the 
consequences of the uncertainty principle is that we 
cannot ascribe to a subatomic particle, such as an 
electron, a well-defined path. We can only talk about 
the spacial probability density |p|? = y*. If the par- 
ticle is a charged particle, then we can associate with 
its probability density a spacial charge density g|/2, 
where q is the charge of the particle. Consider an 
electron in an infinite potential well. Let the electron 
be in an eigenstate x,,. (a) Calculate the spacial charge 
density associated with that particle. (b) What con- 
clusion do you draw about the time dependence of 
that charge density? (c) What is the significance of 
this result in connection with Bohr’s second postu- 
late about the hydrogen atom? 

(Answer: (a) exy2(x).) 


20.21 When an electron is making a transition from 
some initial state w(x, t) to some final state W(x, ft), 
one cannot assert with certainty whether or not the 
transition has already taken place. As a conse- 
quence, the wavefunction describing the electron 
must be a mixture of the initial and final state wave- 
functions, that is, Y(x, t) = A(x, t) + Bys(x, t), 
where A and B are constants. Calculate the spatial 
charge density associated with such an electron. What 
is the physical significance of the result in connection 
with Bohr’s third postulate about the hydrogen atom? 

(Answer: e[A*y7(x) + B*x?(x)] + 


e 2 AB xi(x)x¢(x) cos (= P =!) a 
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21.1 INTRODUCTION 


Thus far, we have applied the Schrédinger theory to an ideal, one-dimensional 
case. One of the main features of the theory is that the quantization of the 
energy does not come in as a postulate but instead it comes naturally from 
the solution of the Schrédinger equation and the physically justified require- 
ments that the wavefunction be well-behaved. The great success of the Schr6- 
dinger theory, which is a postulate, but a fundamental one, is that one is 
able to derive from mathematical principles the postulates that had originally 
been presented to explain the classically unexplainable experimental results. 
Thus, if one solves the Schrédinger equation for a particle acted on by a linear 
restoring force F = —kx, that is, for a classical harmonic oscillator with po- 
tential energy E, = 3 kx*, such as a mass attached to a spring, the only 
physically acceptable solutions are those for which the energy E has values 
given by 


E, = (n + 3) hv wheren = 0,1, 2,3,... 


This result leads to the same conclusion as that of Planck concerning the 
energy of the electromagnetic spectrum produced by such oscillators, namely, 
Ewaves = nhv, and therefore the Schrodinger theory explains the spectrum of 
a blackbody without the need for a postulate. Note that the solution of the 
spectrum given here has a factor of 3 in it. This means that in the lowest 
energy state n = 0 a system still has vibrational energy. Therefore, even 
at a temperature of absolute zero, electrons, atoms, and such, will still vi- 
brate. 

Similarly, when one solves the Schrédinger equation for an electron in 
the potential of a proton (a hydrogen atom), one again finds that the only 
physically acceptable solutions for the wavefunctions are those where the 
energy E takes certain discrete values. The values are those postulated by 
Bohr, which in turn led to the explanation of the hydrogen spectrum. The 
Schrédinger theory provides much more information than only the quanti- 
zation of the energy spectrum. It explains why certain spectral lines are brighter 
than others; that is, it can be used to calculate transition rates. It explains 
some fine details about the spectrum; that is, some of the spectral lines under 
a high resolution spectrometer turn out to be two or more very closely spaced 
lines (two or more different wavelengths differing by a few angstroms). 

The mathematical complexities do not permit us to solve in detail the 
Schrédinger equation for the coulomb potential of the hydrogen atom. We 
will, however, outline the solution of the problem and present the results 
that are obtained by solving the Schrédinger equation. We will then discuss 
the physical significance of these results and use these results together with 
a very important principle of quantum mechanics, the exclusion principle, to 
gain some understanding of the ground state of multielectron atoms. Then 
we will be ready to apply the quantum theory to solids. 
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21.2 OUTLINE OF THE SOLUTION OF THE 
SCHRODINGER EQUATION FOR THE H ATOM 


We saw in connection with the Bohr model of the hydrogen atom, Eq. 18.1, 
that the potential energy of the electron in the electric field of a nucleus is 


1 2 
E,=-——= 


ATrey r 


where r = (x? + y? + z?)'? is the distance between the electron and the 
proton. If the nucleus has more than a single positive charge we must include 
the additional charges in the Coulomb energy. We do this by a multiplicative 
term Z, called the atomic number, that represents the number of protons in 
the nucleus and also the number of electrons in neutral atoms. Thus the 
charge part of the Coulomb equation for a single electron becomes eZe or Ze?, 
and the above energy equation, which is for Z = 1, is written as 

1 Ze? 

E, = - — (21.1) 


ATE r 


We extend the time-independent Schrédinger equation (Eq. 20.17) to a 
situation where a particle is free to move in three dimensions. To do this we 
simply include derivatives of the space part of the wavefunction, y with 
respect to the other two coordinates, y and z, namely, 

07 0? 07 

~ Fe ee + Ey (x,y, xX = EX 

This is a partial differential equation and, because E, is a function of x, y and 
z, it is difficult to solve in the form shown here. However, because E, is a 
function of the separation between the protons and the electron only, it is 
much simpler to solve the equation if we write it in spherical coordinates (see 
Fig. 21-1). In spherical coordinates the Schrédinger equation becomes 


1 a oY 1 of. .oy 1 oy 
eo | a ee 0) ; 
r? or (: *) r?sin@ 060 (sin *) rsin20 dd i 
2m _ 
+ 55 IE - Ext) x = 0 (21.2) : 
This is an awesome looking equation, but one that can be solved by the | _ Pe 
standard technique of separation of variables; we try a solution of the form Pe ee 
x(7,9,4) = R(r) (8) ®4) wil ™™ 
FIGURE 21-1 


where R, 0, and @ are functions of only one coordinate. On substitution into Spherical coordinates r, 0, and 
Eq. 21.2, rearrangement of terms, and division of both sides of the resulting of a point P. 
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equation by R(r), 6(6), and ®(¢), we obtain 


sin*0 d /( ,dR DHE yc 
R dp ¢ z Pe r* sind [E — E,r)] 


dO 1 d*® 
sin “et 1d*® (1.3) 


6 do\” do) Ode 


The left side of Eq. 21.3 is a function of r and 6 alone, whereas the right side 
is a function of ¢ alone. The only way the equation can be valid is when both 
sides of the equation are equal to the same constant. For convenience, we let 
this constant be —m?, where this m, is not to be confused with the m used 
for mass. The right side of the equation gives us an ordinary differential 
equation for ® 


ae SS (21.4) 


The constant is chosen as — mf in order that m, will be an integer, as will be 
indicated later. The left side of Eq. 21.3, after being set equal to the constant 
— mf and rearranged, may be written as 


1d > aR 2mr2 
aay re pace Ba 
R dr ¢ 7) +t | pA?) | 


_ mM 11 da/f/. 9 dO 

~ sin? © sin 0 d6 (sin 4 
Again, one side of this equation is a function of the variable r only, whereas 
the other side is a function of the variable 6 only. The only way the equality 
can be valid is when both sides are equal to the same constant. If we call this 
constant / (J + 1), / will be an integer. We now get two differential equations, 
one for R(r) and another for O(6). 


mz 1 df{. ,dO\ | 
sin? 6 O sin 640 (sin : 4 mieeieste 2) ot) 
and 
1d /_,dR 2mr? 
(2 E-E(n}=10 +1 | 
Rap ¢ + 4D [ r(1)] (1 + 1) (21.6) 


The next step is to solve the three differential equations (21.4, 21.5, and 21.6). 
These three equations and their solutions are well known in mathematics. 
We will simply mention what happens when one solves them subject to the 
constraints on the wavefunctions discussed in Chapter 20. 


1. When we solve Eq. 21.4, we find that the only solutions for ® that are 
single-valued are those for which (see Problems 21.1 and 21.2) 


m, = 0, +1, +2,... (21.7) 


PHYSICAL SIGNIFICANCE OF THE RESULTS 


2. When we solve Eq. 21.5 for 0, we find that the only solutions that are 
finite everywhere are those for which 


1=0,1,2,... (21.8) 


and 


|= |m|| 

3. Finally, when we solve Eq. 21.6, we find that the only solutions for R that 
remain finite everywhere are those for which 

Z7e*m 1 

5, = etm 1 


Gere aaa 


(21.9) 


and 


l<n 


The restrictions on the values that m, and / can take can now be restated 
as follows: 


(21.7a) 


and 
(21.8a) 


21.3 PHYSICAL SIGNIFICANCE OF THE RESULTS 


The most important result of the solution of the Schrédinger equation for the 
H atom is the fact that the energy of the atom is quantized (Eq. 21.9). The 
spectrum of energy levels is the same as the one postulated by Bohr. Because 
the energy depends only on the quantum number 1, it is called the principal 
quantum number. 

The spacial variation of x depends, however, on the three quantum num- 
bers n, l, m,, which arise from the integers of Eqs. 21.7a, 21.8a, and 21.9, and 
the wavefunction is written with the quantum numbers as subscripts, Xn1m,. 
Because for a given n the other two numbers can take several values, this 
means that it is possible for the electron to have quite different characteristics 
while maintaining the same energy. States y having the same energy but 
different values for the quantum numbers ! and m, are called degenerate states. 
The degree of degeneracy depends on the value of n. For example, from Eqs. 
21.7a and 21.8a, if n = 1, 1 = 0, and m, = 0; we have only one state Yj0. If 
n = 2,1 = 0or1; for! = 0, m, = 0; for! = 1, m, = 0, 1, or —1. Thus for 
n = 2, we have a fourfold degeneracy with wavefunctions X20, X210, X211, 
and 21-1, all having the same energy. This degeneracy is often represented 
(partially) in a two-dimensional diagram (see Fig. 21-2). The vertical axis shows 
the quantum number n. Equation 21.9 shows that as n increases E becomes 
less negative and therefore larger. The horizontal axis is the quantum number. 
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FIGURE 21-2 

Partial representation of the de- 
generacy of the eigenfunctions 
in the hydrogen atom. The en- 
ergy E is determined by the 
quantum number n alone. The 
spacial variation of the eigen- 
functions depends on the three 
quantum numbers n, 1, m,. States 
with the same n but different / or 
m, have the same energy and are 
said to be degenerate states. The 
degeneracy with respect to m, is 
not shown in the figure. 
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States for which ! = 0 are called s states, those with] = 1 are called p states, 
those with | = 2 are called d states, those with | = 3 are called f states, 
following the initials of the original names given by spectroscopists to groups 
of spectral lines; sharp, principal, diffuse, fundamental. 

Although further details of calculations will not be shown, certain results 
of these calculations are important and will be listed and discussed. 


1. The quantum number ! is called the orbital quantum number, because | 
determines the magnitude of the angular momentum L of the atom. 


2. The quantum number m, is called the magnetic quantum number, because m, 
determines the orientation of the angular momentum L in a magnetic field. 
It can be shown that if an atom is placed in a magnetic field directed along 
the z direction, the z component of the angular momentum L of the atom 
is given by 


L, = mh (21.10) 


To have a feeling of what we mean by the orientation of L, we must remember 
that in order to specify the state of rotation of an object, it is not enough to 
say how fast the object is rotating, we must also specify the plane of rotation, 
that is, we must give a direction to L. The convention is that the vector L is 
perpendicular to the plane of rotation, and the sense is given by the right- 
hand rule, Eq. 8.25. The result of Eq. 21.10 tells us that in an atom, L cannot 
have any arbitrary orientation with respect to the z axis (as would be the case 
classically), but rather it can have only certain discrete orientations. This is 
known as space quantization. Suppose ! = 2, then m, can be 2, 1, 0, —1, —2. 
Thus L, = 2h, h, 0, —h, —2h. See Fig. 21-3. 

So long as there is no preferred direction, space quantization is mean- 
ingless. If, however, you perform an experiment in which a particular di- 
rection becomes preferred, such as by the application of a magnetic field, 
then direction becomes meaningful. If the field tends to align the angular 
momentum of the atom along a given direction, then that direction becomes 
the z axis. 

Is this prediction of the Schrédinger theory concerning space quantization 
borne out by the experiment? We will now examine the behavior of an atom 
in a magnetic field. 


21.4 SPACE QUANTIZATION: THE EXPERIMENTS 


21.4a The Zeeman Effect 


We know that if a current 7 flows through a loop of area A, a magnetic dipole 
moment mw = 7A is associated with it (see Eq. 16.10 et. seq.) If we place this 
dipole in an external uniform magnetic field B (see Fig. 21-4), it will experience 
a torque 7 = pw X B = w B sin 6, where @ is the angle between the two 
vectors, Eq. 16.12. The torque is a maximum when the dipole is perpendicular 


FIGURE 21-3 

Possible orientations of the an- 
gular momentum L of the elec- 
tron in the hydrogen atom for 
the case where the orbital quan- 
tum number / = 2. 


CL. = mh 


FIGURE 21-4 
A magnetic dipole moment 
p = iA ina magnetic field B. 
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to the field and zero when it is parallel or antiparallel to it. As a consequence, 
we can associate with such a dipole mu in a magnetic field B a potential energy 
that will depend on the orientation of w with respect to B. The potential 
energy E,, is, 

E, = — m:B = pBocos 0 (16.14) 


Let us consider an electron in one of the Bohr orbits (see Fig. 21-5); the 
circulating electron represents a current i given by 


, _ charge _ lel _ lel _ lelv 
time T 2ar 2a1r 
v 


where T is the period of rotation, v is the tangential speed of the electron, 
and r is the radius of the orbit. Then 


In quantum mechanics we cannot talk about a particle moving in a circular 
loop of radius r with a velocity v. We can rewrite this expression in a form 
that is meaningful quantum mechanically by multiplying the numerator and 
the denominator by the mass of the electron and recalling that mur is the 
angular momentum L of a particle of mass m rotating with a tangential 
velocity v about a point a distance r away. 

lelrv _ |elmor __ el 


2 2m 22m 


le| 

M= 35 L (21.11) 
We have included a minus sign because the direction of i is opposite to the 
direction of motion of the electron, hence of L (see Fig. 21-6); and we have 
written yt with the subscript / to indicate that the dipole is associated with 
the orbital motion of the electron. Thus, we have established that an atom 
with angular momentum L has associated with it a magnetic dipole moment. 
If we now place the atom in a magnetic field B, there will be an extra con- 
tribution to the energy given by Eq. 16.14 


E, = — or = 7 L-B= 


(Note that this is not the Coulomb potential energy term of Eq. 21.1.) From 
Fig. 21-6, L cos 0 = L,, therefore 


E, = rel BL, (21.12) 


The total energy of the atom, Erota, will be the sum of the energy resulting 
from the interaction of the electron with the nucleus, that is, E,, of Eg. 21.9, 


FIGURE 21-5 
An electron orbitting the nucleus 
in one of the Bohr orbits. 


FIGURE 21-6 
Relative orientation between the 
angular momentum of the elec- 
tron L and the associated mag- 
netic dipole moment st. 
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and the energy resulting from the interaction of the dipole moment with the 
magnetic field B, that is, E, of Eq. 21.12. 


Erotar = En + Ey (21.13) 


Classically, L, can take any value between L and —L. This means that an 

energy level E,, would become a band of width 2 (|e|/2m) (BL) (see Fig. 21-7). 

Quantum mechanically, however, L, can take only certain values, that is, 

L, = mh. Thus the total energy will be given by 
lel 


Fvrotal = ioe + -— Bh Mm 


- (21.14) 


Because m, takes values 0, +1, +2, ... +1, the energy levels of quantum 
states with orbital quantum number ! larger than 0 and thus, with several 
values for m), split into several discrete sublevels (see Fig. 21-8). If we now 
reexamine the spectrum of hydrogen (Eq. 21.9, Fig. 21-2), then = 2ton = 1 
transition should give rise to three different lines (three different A’s). In 
the transitions from higher n states, we may expect even a greater number 
of lines. However, there is a selection rule that initially was found experi- 
mentally and was later derived from the Schrédinger theory which states that 
the only allowed transitions are those for which Am; = 0 or +1. Thus, re- 
gardless of the number of sublevels, transitions from a given n to another 
should give rise to only three distinct lines. This effect is known as the normal 
Zeeman effect. It had been observed experimentally in the spectrum of such 
elements as helium, calcium, zinc, and a few others long before quantum 
mechanics was formulated. Often more than three lines are observed. For 
example, in hydrogen the n = 2 to n = 1 transition shows seven closely 
spaced lines. This effect is known as the anomalous Zeeman effect. Thus, the 
experiment shows that the idea of space quantization is correct. However, it 
also shows that the Schrédinger theory as originally formulated was incom- 
plete. In Section 21.5 we will show what was necessary to complete the theory. 
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Example 21-1 


Calculate the normal Zeeman splitting of the calcium 4226 A line when the 
atoms are placed in a magnetic field of 1.2 T (tesla). 


FIGURE 21-7 

Classically, the energy levels of 
the electron in the atom should 
become energy bands of width 
2 eBL/2m when placed in an 
external magnetic field B. 


FIGURE 21-8 

Schematic representation of the 
splitting of the 2p state of the hy- 
drogen atom into three Zeeman 
energy levels corresponding to 
the three possible orientations 
(three possible values of m,, 0, 1, 
—1) of the angular momentum 
L with respect to the external 
magnetic field B. Note that the 
two s states are not affected by 
the magnetic field because for 
those states m, = 0. 
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Solution From Eq. 21.14, the energy between adjacent Zeeman levels is 
given by 
lel 
AE = — Bh 
2m 


1.6 x 10°'%C x 1.2T x 1.05 x 10~* J-sec 
2x 9.1 x 107%! kg 


1.11 x 10° J = 6.92 x 107° eV 


The relation between the energy difference of two levels and the associated 
difference in the wavelength of the photons emitted as a result of transitions 
from those levels to the same final state can be found by using Einstein’s 
relation: Epnoton = hv = hc/d. Differentiating this expression with respect to 
the wavelength, we obtain 


he 
dE = —- 2 dx 
or 
he 
JaE| = 55 (aa 


Because the shift in energy, AE, is small compared with the energy of the 
level, we can make the approximation dE ~ AE. Thus, 


r2 
AA| = — 
JAa| = 7 AE 


_ (4.226 x 1077 m)? x 1.11 x 10°* J 


= = 9.96 x 10-2 m = 0.099 
6.63 X 107%4 J-sec x 3 x 108 m/sec m 6A 


21.4b Stern-Gerlach Experiment 


Another experiment that shows the idea of space quantization of the Schré- 
dinger theory as being correct although incomplete is the Stern-Gerlach ex- 
periment. 

We know that when a magnetic dipole is placed in a uniform magnetic 
field B it experiences a torque. However, if the field B is not uniform, the 
dipole will also experience a net force. Figure 21-9 is a schematic of a non- 
uniform magnetic field where the B arrows represent separate parts of the 
magnetic field where the vector sum at any point is the total magnetic field 
at that point. Because the density of B lines represents the strength of B, it 
is clear from the figure that B increases in the z direction; thus there is gradient 
dB/dz along the z axis. It is also clear from the picture that a net force directed 
upward will act on a magnetic dipole placed in the field. In general, it can 
be shown that if there is a magnetic gradient along the z axis the force will 
be directed along the z axis and the magnitude of the force will depend on 


Pieter Zeeman (1865-1943). 


FIGURE 21-9 

Forces on a current-carrying loop 
(a magnetic dipole) placed in an 
inhomogeneous magnetic field 
with a gradient dB/dz in the z di- 
rection. 
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the magnitude of the gradient and on the orientation of pw, with respect to 
the gradient. The actual value of the force is 


dB 
FL = mh 


Po eh 


where 9 is the angle between pw, and the gradient dB/dz. From Fig. 21-10, 
bt; COS 8 = py, 


dB 
PF, = pr rs FIGURE 21-10 
Z 


From Eq. 21.11 we obtain 


(21.15) 


Classically, L, can have any value between L and —L,; that is, if you send a 
beam of magnetic dipoles 1; through such an inhomogeneous magnetic field 
B(z), they will experience a force that can take any value between 


ld 4B ang LB 
2m dz ene 2m : dz 


In the experiment, Stern and Gerlach sent a beam of neutral silver atoms 
from a heated oven through a collimator and then through a region with a 
nonuniform magnetic field B(z) with a gradient along the z axis (see Fig. 
21-11). The silver atoms were collected on a screen. Although the atoms were 
uncharged, they did possess a magnetic dipole. Therefore, the only force 
acting on them was the one resulting from the inhomogeneity of the magnetic 
field. Depending on the magnitude of this force (that is, on the orientation 
between p, and dB/dz), the atomic magnetic dipoles will be deflected as they 
travel through the magnet. The stronger the force, the greater the deflection, 
and therefore the farther away from the center they would land on the screen. 


Collimators Magnet 


Screen 


Oven 


Ag 
atoms 
Cross-sectional view ae in FIGURE 21-11 
of the magnet Beam- | 


(end view) Ad A \ Schematic of the experimental 


setup used in the Stern-Gerlach 
experiment. 


(Side view) 


Because from classical physics the force F varies continuously between the 
two values mentioned earlier, one would expect that the distribution of atoms 
on the screen would be more or less uniform between some Z,,,, and —Zmax/ 
that is, we would expect the distribution to be of the form shown in Fig. 
21-12. On the other hand, if the Schrédinger theory is correct, and L, is 
quantized, then the force in the z direction, F,, can have only a discrete number 
of values, one for each m,. We would therefore expect that the atoms would 
land at some discrete distances z from the center of the screen. The experi- 
mental results for silver (Ag) atoms were as shown in Fig. 21-13. Because all 
atoms landed at a given distance z above zero or at the equivalent distance 
—z below zero, this result shows that the force F, was either plus or minus 
a given value and that L, had only two possible values. The experiment 
confirmed the concept of space quantization. This experiment was repeated 
by Phipps and Taylor with hydrogen atoms, and the results were identical 
in form to those shown in Fig. 21-13. 

Although the experimental results showed splitting, which demonstrates 
space quantization, they were in quantitative disagreement with the Schré- 
dinger theory. Because the magnetic quantum number values are m, = 0, 
+1, +2,... +l, there will be (21 + 1) possible values, and because / is an 
integer, 2] + 1 is an odd integer. Therefore, m, should take an odd number of 
values, and one of them should be 0. Because m, determines L, and hence F,, the 
same statement applies to F,. In fact, in the case of hydrogen the expected 
experimental result is obvious: one line at z = 0. The reason is that at the 
temperature of a few hundred degrees, all the atoms should be in the ground 
state, which is called the 1s state (characterized by the quantum numbers 
n = 1 and/1 = 0). In this state 1 = 0, and therefore m, = 0 and uw, = 0. 
However, because the Phipps-Taylor experiment on hydrogen showed two 
well-defined z spacings and no value at z = 0, it is evident that there is a 
magnetic dipole other than the orbital dipole that has been overlooked. We will 
next consider this new modification. 

One final piece of experimental evidence that illustrates this oversight is 
the following: In the absence of an external magnetic field, the transition in 
hydrogen from n = 2 ton = 1 should give rise to just one line, one 4, Fig. 
21-2. This transition in the absence of a magnetic field is shown in Fig. 
21-14. Actually, the experimental observation of this spectral line shows two 
very closely spaced wavelengths that correspond to two very close frequen- 
cies, that is, energies. This phenomenon is known as the fine structure of a 
spectral line. 


21.5 THE SPIN 


What has the original Schrodinger theory overlooked? It was while studying 
the fine structure of the spectrum that G. Uhlenbeck and S. Goudsmit pro- 
posed the idea of the electron spin: The electron has an intrinsic angular mo- 
mentum called the spin S. Just as the orbital angular momentum L has a magnetic 
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FIGURE 21-12 

Expected distribution (according 
to classical principles) of the sil- 
ver atoms as they land on the 
screen of the experiment in Fig. 
21-11. The z = 0 position corre- 
sponds to atoms that go through 
the magnet undeflected. 


2 


FIGURE 21-13 

Experimentally observed posi- 
tions of the landings of silver at- 
oms on the screen in the exper- 
iment of Fig. 21-11. 


ss 0D 


| 
a B=0 


FIGURE 21-14 

Fine structure. In the absence of 
an external magnetic field, tran- 
sitions in hydrogen atoms from 
the state n = 2 to the state n = 
1 should yield photons of a sin- 
gle frequency because the 2s and 
the 2p states are degenerate states. 
Two slightly different wave- 
lengths are observed. 
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dipole associated with it, so does the spin; that 1s, the electron has a spin dipole moment 
pe; = — e/mS. By analogy with the behavior of L and in order to explain the 
experimental results, they postulated that the magnitude of S and its z com- 
ponent were quantized as follows 


S = [s(s + 1)]'2h% wheres = 3 


- = wed 
S. = m,h where m, = +34 


The existence of the spin and its behavior enter as a postulate in the Schro- 
dinger theory. The reason is that this theory is nonrelativistic. In the relativistic 
quantum mechanical theory developed later by P. Dirac, the existence of the 
spin and the rules governing its behavior are a natural consequence of its 
formulation. The development of relativistic quantum mechanics is beyond 
the scope of this book. 

Using the spin postulate, the experimental results concerning the anom- 
alous Zeeman effect, the Stern-Gerlach experiment and the Phipps-Taylor 
experiment, as well as the fine structure, can be explained. For us the main 
conclusion is that the state of an electron is now specified by four quantum numbers: 
n, l, m, m,. Note that the quantum number s is 3 for all individual electrons 
and thus we do not need to specify it further. 


Example 21-2 

A beam of hydrogen atoms is used in a Stern-Gerlach type experiment. The 
atoms emerge from the oven with a velocity v = 10* m/sec. They enter a 
region 20 cm long where there is a magnetic field gradient dB/dz = 3 x 
10* T/m. The field gradient is perpendicular to the incident velocity of the 
atoms. The mass of the hydrogen atom is 1.67 x 10~* kg. What is the sepa- 
ration of the two components of the beam as they emerge from the magnet? 


Solution In the ground state, hydrogen atoms have no net orbital mag- 
netic dipole moment. The only dipole moment is the one associated with the 
spin of the electron in the 1s state, that is, uw, = — |el/m S, where m is the 
mass of the electron. From our previous discussion of the Stern-Gerlach ex- 
periment, 
dB | dB fe dB 


Fa = Me a= maz ~ 2m. dz 


We can use Newton’s second law to find the acceleration a, of the hydrogen 
atoms as they traverse the magnet. 


e| h — 
F, le dz 
Matom 2 m Matom 


1.60 x 107!9C x 1.05 x 10°-*4 J-sec x 3 x 104 T/m 
2x 9.1 x 107-3! kg x 1.67 x 10°?” kg 


1.65 x 10° m/sec? 


5S. = #4, fi 


where mi. = 


SOME FEATURES OF THE ATOMIC WAVEFUNCTIONS 333 


The deflection of each component in the direction of the force (z axis) will be 


1 
Az = 5 act 
where ¢ is the time that the atoms spend in the magnet. This time can be 
found by dividing the length of the magnet by the incident velocity of the 
atoms 
0.20 m 


= —~— =2x 10-5 
10* m/sec as 


Therefore 
Ao = ; x 1.65 x 108 m/sec? x 4 x 107!9 sec? 


= 3.3 x 10°? m = 3.3.cm 


Because of the two possible values for m,, some atoms will be deflected upward 
and some downward. Therefore, the separation between the two components 
of the beam will be 2 Az or 6.6 cm. 


21.6 SOME FEATURES OF THE ATOMIC WAVEFUNCTIONS 


We now have almost all the information necessary to understand the impor- 
tant features of multielectron atoms and to proceed to the study of solids. A 
couple of additional facts that we need are found by looking at the wave- 
functions obtained in the solution of the Schrédinger equation for the hy- 
drogen atom. 

Figure 21-15 shows simulated densities representing |x|? as a function of 
x, y, and z for the lowest energy states. One important feature is that, except 
for s states (that is, 1 = 0), |y/* does not have spherical symmetry; it has axial 
symmetry (a consequence of L, having a definite value). However, if we take 
a subset of states made up of states having the same n and | but different m, 
and we add up |y/? for all those states, we will find that the sum has spherical 
symmetry. This can be shown directly by taking the actual wavefunctions 
and then evaluating the sum 


% 
>» X nlm, Xnim, 


my 


This sum is independent of the orientation angles @ and @. But the result can 
be seen to be true by simply looking at Fig. 21-15. It turns out that the sum 
of all the wavefunctions of a subset of states having the same n and | is 
spherically symmetric not only for the hydrogen atom wavefunctions but for 
those of multielectron atoms as well. What this means is that if you have one 
electron in each of the m, states associated with a given I, the resulting charge 
distribution will have spherical symmetry. This is important because it 
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n=3,l=2, m,=0 


FIGURE 21-15 

An artist simulation of the prob- 
ability densities |y|? as a function 
of x, y, and z for the lowest en- 
ergy states of the hydrogen atom. 
The vertical lines correspond to 
the z axis. (Source: Robert Eisberg 
and Robert Resnick, Quantum 
Physics of Atoms, Molecules, Sol- 
ids, Nuclei, and Particles, 2nd ed. 
Copyright © 1985 by John Wiley 
& Sons, Inc. Reprinted by per- 
mission of John Wiley & Sons, 
Inc). 
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explains partially why the rare gases helium (He), neon (Ne), argon 
(A), ... are inert. It will also allow us to understand why atoms such as 
lithium (Li), sodium (Na), potassium (K), . . . are hydrogen-like. We will use 
these facts later. 

Another important feature of the wavefunctions can be found by looking 
at Fig. 21-16. It shows the radial probability density, P(r), where P(r) dr is the 
probability of finding the electron between r and r + dr (that is, summed 
over all angles). These curves are obtained by integrating the probability 
density per unit volume *w over the volume enclosed by the mathematical 
spherical shells with radii r and r + dr, respectively. As can be seen, P(r) 
varies with the distance from the nucleus r, with a maximum at certain r 
values that depends on the particular wavefunction. The horizontal axis is in 
units of r9¢, where ro is the radius of the smallest orbit in the Bohr model. 
Notice that an electron in quantum state n = 1 has an average r value of 
about ro; an electron in state n = 2 has an average r ~ 4 ro, and so on. Looking 
at it differently, an electron in state n = 2 is more likely to be found outside 
the region that would be occupied by an electron in state n = 1. An electron 
in state n = 3 is more likely to be found outside the region occupied by an 


FIGURE 21-16 

Radial dependence of the prob- 
ability density (probability of 
finding the electron at a certain 
distance from the nucleus re- 
gardless of the angular position) 
versus distance from the nucleus 
for the lowest energy states of 
the hydrogen atom. The trian- 
gles on the horizontal axes in- 
dicate the average value of r for 
that state. (Source: Robert Eisberg 
and Robert Resnick, Quantum 
Physics of Atoms, Molecules, Sol- 
ids, Nuclei, and Particles, 2nd ed. 
Copyright © 1985 by John Wiley 
| EEs. “Saeoe & Sons, Inc. Reprinted by per- 
5 10 15 20 25 mission of John Wiley & Sons, 
Smee 7 (ih UNITS Of 7G) Inc). 
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electron in states n = 1 and n = 2, and so on. But an electron in state n = 
2,1 = 0 has a small but finite probability of being inside the space occupied 
by the electron in state n = 1, that is, being very close to the nucleus. This 
tendency is less pronounced for the electron in state n = 2, / = 1. Similar 
remarks apply for the three states with n = 3. An electron with n = 3,1 = 
0 has a small chance of being inside the region occupied by electrons in state 
n = 2 and also a chance of being very close to the nucleus inside the region 
occupied by electrons in state n = 1. The electron with n = 3,1 = 1 shows 
the same tendency for the region occupied by electrons with n = 2 but not 
for the electrons with n = 1. Finally, electrons with n = 3,1 = 2 do not have 
either of the two tendencies. We can summarize this effect as follows: For a 
given state n, the electrons in the lower angular momentum states are more likely to 
be found near the nucleus than those in states with higher angular momentum. In 
the case of hydrogen, this effect is of no consequence; states with the same 
n and different | are degenerate and therefore have the same energy. In the 
case of a multielectron atom, this effect is important because it will explain 
why states with the same n but different / have different energies. This fact 
is important in understanding some of the general features of the periodic 
table. 


21.7 THE PERIODIC TABLE 


21.7a Pauli’s Exclusion Principle 


The lowest energy state of an electron is called the ground state. In the hy- 
drogen atom this is specified by the following quantum numbers. 


n= 1,1=0,m, = 0, andm, = 4or — 3 

What happens in an atom with 70 electrons? Are all 70 electrons in a state 
specified by these four quantum numbers? The experimental evidence says 
not. This conclusion is reflected in Pault’s exclusion principle, which states: No 
two electrons in a system (be it an atom or a solid) can be in the same quantum state. 
That is, no two electrons can have identical values for the set of quantum numbers 
specifying the state, which in the case of an atom are n, l, m,, m,. At least one of the 
quantum numbers must be different. 

Professor Banesh Hoffman of the Queens College Mathematics Depart- 
ment has put it in a very colorful way: “It is as if the atom were a large city 
where electrons live in separate apartments. Each apartment has a different 
address, one quantum number indicating the street, another the house, a 
third the floor, and the fourth the apartment. These four quantum numbers 
are then the complete address of each apartment, and Pauli’s principle is a 
regulation against overcrowding.” 


‘The Strange Story of the Quantum. Banesh Hoffmann. Dover Publications Inc. N.Y. (1959). 


Wolfgang Pauli (1900-1958). 


Many physical and chemical properties of the elements are periodic functions 
of the atomic number Z, which is the number of positive charges (protons) 
in the nucleus and is also equal to the number of electrons for neutral atoms. 
This periodicity has to be taken loosely, in the sense that the period is not 
constant. But the fact remains that elements such as H, Li, Na, and K have 
very similar properties and so do elements such as He, Ne, and Ar. A standard 
form of the periodic table is presented in Fig. 21-17. If we leave aside for the 
moment the central block (see the scandium to zinc part of this section), the 
table is divided into eight groups (indicated by roman numerals), all elements 
in a group are in the same column. H, Li, Na, ... form the first group; 
beryllium (Be), magnesium (Mg), calcium (Ca) . . . belong to group II; boron 
(B), aluminum (Al), gallium (Ga) . . . form group III; and so on until we get 
to He, Ne, Ar, . . ., which form group VIII. Elements in a group have very similar 
chemical and optical properties. The reason for this is that these properties are 
determined by the electron configuration. Atoms in a group have a similar that the chemical and physical 
electron configuration, as we will soon see. properties of the elements are 

In addition to dividing the elements into groups, we can also divide them periodic functions of the atomic 
into periods (indicated by arabic numerals in Fig. 21-17). H and He form the | number Z. 


Dimitri Mendeleev (1834-1907) 
was the first to indicate in 1869 


Periodic table of the elements. . 
roup 
Group I KEY Vil 


Atomic number 
y “ae ~ = ar 
4.003 


Atomic weight* 


| 32 
A008 44.96 | 47.90 | 50.94 | 52.00 | 54.94 | 55.85 | 58.93 | 58.71 | 63.55 | 65.38 74.92 | 78.96 | 79.90 | 83.80 
4] : 
85.47 a 88.91 | 91.22, 229) 95.94 | (99) | 101.1 | 102.9 | 106.4 | 107.9 | 112.4 aise 118.7 
a 192.2 | 195.1 


Period 


tans fe wi “5 | 
Series 

\ 138.9 | 140.1 3 144.2 pas Js 152.0 J 158.9 62s ete 3 | 168.9 | 173.0 
Actinide ‘ 


Series 


89 95 100 iol ‘102 103 
(227) | 232.0 | 231.0 | 238.0 | 237.0 | (244) | (243) | (247) | (247) | (251 | (254) | (253) | (256) | 25a) _| (257) 


"The number in( ) = Mass Number of the most stable isotope. 
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first period; Li, Be, B, . . . Ne form the second period; Na, Mg, Al, ... Ar 
form the third period, and so on. The chemical properties of the elements in 
a period vary progressively as we go from left to right. 

Let us now examine the electron configuration of some different atoms 
in order to understand their chemical behavior. We must keep in mind two 
guiding principles: | 


1. The electron configuration must satisfy the exclusion principle. Because 
for each quantum number I, the value of m; has (27 + 1) possible values 
and, because for each value of m;, the value of m, can be +3 or —3, we 
have room for 2(2/ + 1) electrons in each subshell. (A subshell is the set of 
states having the same n and / quantum numbers.) Thus, for example, the 
s subshells can accommodate 2 [(2) (0) + 1] = 2 electrons; p subshells can 
accommodate 2 [(2) (1) + 1] = 6 electrons; d subshells can accommodate 
2 [(2) (2) + 1] = 10 electrons, and so on. 


2. The ground-state configuration is the one in which the total energy is a 
minimum. This means that we must know the energy of each electron 
when it is in a given state, that is, in a state characterized by a given n 
and /. In the case of H the energy of the atom is caused by the interaction 
between a single electron and the single positive charge in the nucleus. In 
multielectron atoms the energy is caused by the interaction of Z electrons 
with the positive nucleus of charge + Zle| and the interaction of the Z 
electrons among themselves. (Note that for an atom to be electrically neu- 
tral the number of electrons must equal the number of positive charges.) 
The calculation of the energy associated with these interactions is a long 
and tedious task that cannot be solved analytically but can be solved by a 
numerical method. However, important results from these calculations can 
be understood with intuitive arguments. 


a. The total energy of an electron in a multielectron atom becomes less negative 
(increases) with increasing n. The increase from a given n to the next is less 
pronounced as we go to higher n’s. Everything being equal, the state n = 1 
has a lower energy (is more negative) than the state n = 2, the state 
n = 2 is lower than n = 3, and so on. This can be seen from Fig. 21- 
16, in which higher n is associated with larger r, and therefore the 
electrons in increasing higher n states have decreasingly lower binding 
energies. In addition, it can be seen from Fig. 18.4 that AE,., > 
AE,-,3, and so on. We saw this is true for hydrogen, and calculations 
show that it is also true for multielectron atoms. 


b. For a given n, the electron with the lowest | has the lowest energy. In the case 
of hydrogen, we saw that states with the same n but different / were 
degenerate (had the same energy). In multielectron atoms the situation 
is different. Consider an atom with Z electrons; let us say that Z — 2 
electrons fill a certain number of subshells. We saw previously that a 
filled subshell has spherical symmetry. This means that outside the 
filled subshells, the remaining electrons will see a net charge of 


(+ Zle|) — (Z — 2)le| = 2|e|. This will be true as long as they stay outside 
the space occupied by the inner electrons. We saw, however, that the 
electrons in the outer subshells with small values of | have a small 
chance of being inside the inner subshells, very close to the nucleus. 
This tendency decreases as ! increases. If the electron gets inside, close 
to the nucleus, the nuclear charge will not be shielded by the other 
electrons and the electron will be more tightly bound; that is, its energy 
will be more negative, lower. Because this tendency to get inside is 
greater for the low | states, the energy will be lower for those electrons 
for which / is small. 


With these two guiding principles in mind, let us proceed to determine 
the lowest energy configuration of electrons for the different elements. Con- 
sider first hydrogen, 1H, where the superscript to the left of the chemical 
symbol represents the total number of electrons for that atom. The one elec- 
tron will go to the 1s subshell. (The numeral in front of the letter s indicates 
the value of the quantum number n; for example, the 1s subshell is one for 
which n = 1 and! = 0.) The configuration will be 1s’, where the superscript 
1 on the right side of s indicates the number of electrons in that state. For 
*He, the configuration is 1s*, that is, two electrons in the 1s subshell. Because 
m, can have (21 + 1) values, in the s subshells m, has [(2) (0) + 1] = 1 value, 
this value is m, = 0. By Pauli’s principle, the two electrons in the s subshell 
must have opposite spins. The 1s subshell is full. Because for n = 1, / can 
only be zero, there are no more subshells with n = 1. For the next atoms, 
3Li, ‘Be, . . ., we must begin to put electrons in the subshells with n = 2. 
For n = 2, we have two possible subshells: 2s (J = 0) and 2p (J = 1). As 
indicated herein, the subshell with the smaller / value has lower energy and 
we begin filling the 2s and then the 2p. The electronic configurations for the 
next few atoms are as follows: For °Li the configuration is 1s?2s’; for *Be the 
configuration is 1s?2s?. The 2s is full. For °B, the configuration is 1s*2s*2p’. 
The 2p subshell has room for 5 more electrons. The 2p subshell will be pro- 
gressively filled for the successive elements °C (carbon), ’N (nitrogen), °O 
(oxygen), °F (fluorine) until we get to Ne (neon) whose configuration will 
be 1s72s?2p°. The 2p is full, and there are no more subshells in n = 2. We 
now begin to fill subshells with n = 3, starting with the 3s subshell and then 
the 3p; that is, for ''Na, the electron configuration is 1s72s?2p°3s'. For '*Mg, 
the electron configuration is 1s?2s?2p°3s?. For ‘Al the electron configuration 
is 1572s?2p°3s73p'. . . . . For !8Ar the electron configuration is 1s72s?2p°3s3p°. 
The 3p is full. | | 

We might expect that after the 3p state is full the 3d would begin to fill. 
However, it turns out that the 4s subshell has a lower energy than the 3d 
subshell. Why? There is no simple explanation; it just comes out of the cal- 
culations. We can get a feeling for the reason from the following argument. 
We have seen that the difference in energy in going from a given principal 
quantum number n to the next higher n becomes less pronounced as n in- 
creases. We have also seen that states with low | have lower energy because 
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they have a small probability of being found inside the inner shells, close to 
the nucleus. It is relatively simple to see that the dependence of the total 
energy E on | becomes more important as n increases, and there is an in- 
creasing number of electrons. The reason is that the difference in the charge 
that the electron experiences, when it is close to the nucleus or far from it, 
is greater when there are many electrons than when there are just a few. 
Consider, for example, °Li. The electron in the 2s state sees a net charge of 
only +|e| when it is outside the orbit of the 1s electrons, and +3le| when it 
is inside their orbit, a threefold increase. Now consider !°K, the electron in 
the 4s state sees a net charge of only + |e| when it is outside the orbits of the 
other electrons, but experiences the full charge of the nucleus of + 19|e| when 
it is inside the other orbits, a nineteenfold increase. This argument gives a 
feeling of why the inversion of the energy of shells may take place particularly 
with the more complex wavefunction shapes of the higher energy electrons. 
This inversion of shell energies, that is, 4s states having lower energy than 
3d states, begins to take place with '’K and is predictable from detailed cal- 
culations. Because of this, '’K has the electron configuration 1s*2s?2p°3s73p%4s". 
*°Ca has the configuration 1s*. . . 4s*. The 4s is now full. Next the 3d subshell 
begins to fill. The 3d, with a capacity for 10 electrons is progressively filled 
as we go from 7'Sc (scandium) to *°Zn (zinc). One should notice that the 4s 
and the 3d must be very close in energy, as evidenced by the fact that for Cr 
(chromium) and Cu (copper) the 4s loses one of the electrons to the 3d. 
Inversions similar to that between the 3d and the 4s occur later. With this 
understanding of the configurations of electrons in the ground states of the 
simpler elements, let us now consider the general properties of some of the 
groups of elements in Fig. 21-17. 


Rare Gases 


Atoms in group VIII, He, Ne, Ar, ... are inert gases: They interact very 
poorly with one another or with other atoms; as a result, they do not form 
molecules or solids easily. They are, except under very extreme circumstances 
(low temperature and high pressure), monoatomic gases. Why? 

In these atoms the p subshell (except for He) is filled, and there are no 
additional electrons outside this subshell. Because the electron configuration 
is made up of closed subshells, their spacial distribution (and associated charge) 
has spherical symmetry. Spherically symmetric charge distribution, to an 
external observer (another atom), looks as if it is all concentrated at the center, 
that is, at the nucleus. The net charge of these atoms behaves as a neutral 
point particle at the center of the atom. As a result, these atoms do not have 
a net electric dipole. They do not have a magnetic dipole either. The reason 
is that for each electron with a certain positive m,, there is one with an equal 
negative value of m,, and therefore the net orbital angular momentum L is 
zero. If there is no orbital angular momentum, there can be no magnetic 
dipole moment because yw is proportional to L. Not only is there no orbital 
dipole moment, but there is also no net spin dipole moment. For every electron 


with m, = 3, there is one with m, = —3. In addition, there is a large energy 
difference between the p subshell occupied by the outer electrons and the 
next available subshell, the s subshell. Figure 21-18 shows the energies of the 
shells as a function of Z. It is seen that it takes considerable energy to excite 
(raise to a higher energy level) the electrons of these atoms and thereby destroy 
their spherical electrical symmetry. Such excitation energy is not available at 
ordinary temperatures. 

The standard mechanisms for molecule and crystal formation, which will 
be discussed in the next chapter, are electron exchange and sharing, and 
electric dipole and magnetic dipole interaction. These mechanisms are not 
present when there is spherical electrical symmetry. The electron configu- 
ration of these rare gas atoms is a very stable, low-energy configuration. 


Alkali Elements 


Now that we understand the rare gases, we can also understand many fea- 
tures of the elements in group I called the alkali elements. All these elements 
consist of a rare gas electron configuration plus one extra electron. Because 
the inert configuration is difficult to disturb, the chemical and some physical 
properties are determined by the behavior of this lone electron. For example, 
the optical spectrum is due to excitations of this extra electron. The atom has 
a configuration of electrons very similar to that of hydrogen but with some 
important differences. 

When the extra electron is outside the inert configuration, it sees a spher- 
ical charge distribution that consists of 


+Z |e| — (Z — 1 lel = +lel 


In other words, all the positively charged protons in the nucleus are shielded 


THE PERIODIC TABLE 341 


FIGURE 21-18 

Relative energy of the subshells 
in multielectron atoms as a func- 
tion of the atomic number Z. The 
curve showing the energy of a 
given subshell starts at the value 
of Z for which that particular 
subshell begins to be occupied. 
(Source: Robert Eisberg and Rob- 
ert Resnick, Quantum Physics of 
Atoms, Molecules, Solids, Nuclei, 
and Particles, 2nd ed. Copyright 
© 1985 by John Wiley & Sons, 
Inc. Reprinted by permission of 
John Wiley & Sons, Inc). 
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except one. A net charge of +|e| acts as if it were at the center of the atom. 
This is the situation of the hydrogen atom. We may therefore expect that the 
energy levels for the outer electron should be very similar to those of hydro- 
gen. They are in some cases; those cases for which ! is large. However, they 
are different in cases where ! is small. The reason for the similarities and the 
differences is that electrons in states with a large | tend to stay outside the 
inert configuration, as we have seen previously (see Fig. 21-16). Those elec- 
trons in states with small | spend some time inside the inner shells where 
the positive nuclear charge is not fully shielded. This means that the electron 
in these states will be more tightly bound and consequently will have a lower 
energy, that is, be more negative. 


In Fig. 21-19 we show a few of the experimentally determined energy 


levels of Na (sodium). For purposes of comparison, some of the excited levels 
of H are shown on the left. As we indicated earlier, in the ground state the 
electronic configuration of Na (1s*2s*2p°3s') consists of 10 electrons having 
the same inert configuration as those of Ne (Z = 10) and one electron in the 
3s subshell. As expected from the arguments just mentioned, in Na the other 
subshells with n = 3, that is, the 3p (I = 1) and the 3d (J = 2) are subshells 
of higher energy than the 3s. It is clear from Fig. 21-19 that the 3s state of Na 
is quite different in energy from the 1 = 3 level of hydrogen, approximately 
3.7 eV (remember that for hydrogen, states having the same n but different 
! are degenerate, Fig. 21-2). The difference between the 3p state of Na and 
the n = 3 state of hydrogen is less, ~1.5 eV, and the 3d state of Na is almost 
identical to the n = 3 state of hydrogen. Similar conclusions are drawn if we 
compare the n = 4 states of Na and the n = 4 state of hydrogen. Experiment 
confirms the qualitative arguments concerning the similarities and the dif- 
ferences between hydrogen and the alkali elements. 


FIGURE 21-19 
Schematic comparison of the en- 
ergy levels of hydrogen and so- 
dium atoms. 


Because the outer electron is either fully or partially shielded from the 
full charge of the nucleus by the other electrons, the energy needed to remove 
it from the atom, that is, to ionize the atom, is small. As a result, the alkali 
atoms are very active chemically: They easily lose the outer electron to other 
atoms and can therefore interact electrically with them. Most of these elements 
form ionic compound crystals, or metallic solids. We will discuss this in some 
detail in the next chapter. 


Example 21-3 


Let us assume that Na atoms are hydrogen-like atoms. The electron config- 
uration of Na (Z = 11) is 1s?2s?2p®3s'. (a) What would be the expected energy 
needed to remove the 3s electron? (b) The measured ionization energy of Na 
is 5.14 eV. What is the effective positive charge seen by the 3s electron? 


Solution 


(a) The electron configuration of Na atoms consists of 10 electrons having 
a rare gas configuration plus one electron in the 3s subshell. Because 
the rare gas configuration has spherical symmetry, when the 3s elec- 
tron is outside this configuration it sees a net positive charge of +|e| 
at the center of the atom. As a consequence, the energy spectrum 
of this electron should be the same as that of hydrogen, that is, 


13.56 eV 
sz 


n 


In particular, the ground-state energy, and therefore, the energy 
needed to remove the 3s electron should be 


13.56 eV _ 
a a. 
(b) From Eq. 21.9, E,, « Z?. We can make use of the fact that the ionization 


energy is 5.14 eV, rather than 1.51 eV, to find the effective charge 
seen by the 3s electron. | 


E3 = —- 1.51 eV 


13.56 
E3 = —5.14eV = Dies (- 32 eV) 
Solving for Zesiective, 
5.14eV x 9]"” 
: = | ———qc“« = J. 
Li hectics 13. 5 6 eV 85 


The atoms that readily accept the electron that the alkali elements may easily 
give away are the elements in group VII. The reason is simple; these atoms 
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find themselves in the opposite situation. They have an imperfectly shielded 
nuclear charge; they lack one electron in the p subshell to have an inert low- 
energy configuration. 

When chlorine (Cl) acquires an extra electron to become the negative ion 
Cl”, this electron is bound by 3.62 eV. This means that in the process of 
capturing this electron, 3.62 eV of energy are released, 


Cl +e —~Cl” + 3.62 eV 
On the other hand, to ionize Na, 5.14 eV must be provided 
Na + 5.14 eV — Na*™ + e— 


It would seem that energetically, this cannot occur naturally because an amount 
of energy equal to 5.14 eV — 3.62 eV = 1.52 eV must be provided. However, 
there is an additional coulombic attraction energy to be considered. The bind- 
ing between Na* and Cl” involves a net release of energy, that more than 
compensates for the difference (see Section 22.3a). The consequence is that 
elements in group I combine readily with those in group VII to form very 
stable compounds, for example, NaCl, LiF, NaBr. 


Scandium to Zinc (Sc—Zn) 


A complete discussion of the properties of each group in the periodic table 
of elements is beyond the scope of this book. However, there is one more 
point of pertinent interest. As mentioned in connection with the alkali ele- 
ments, the chemical and physical properties of an atom are determined by 
the outermost electrons (electrons outside the inert configuration). These are 
called the valence electrons. We can see, qualitatively at least, that in going 
from left to right in a period, the chemical properties change more or less 
progressively because the number of valence electrons increases. This, in 
physical terms, means that the net electric and magnetic dipole moments 
change, which in turn changes the interactions with other atoms. This pro- 
gressive change in the chemical properties breaks down at the central block. 
In fact, the chemical properties of the atoms from Sc through Zn are very 
similar, and it is not difficult to see why. What happens in going from Sc to 
Zn is the addition of one more electron in the 3d subshell for each element 
in this progression. The 3d is being filled while the 4s is already full. The 
reason for the inversion, given earlier, is that the 4s state is lower in energy 
than the 3d state. However, the average radius of the 4s is greater than that 
of the 3d. As a consequence, the 4s electrons shield the 3d electrons from 
external influences. Thus, adding one more electron to the 3d subshell inside 
the filled 4s subshell does not appreciably change the interaction between 
different atoms. It is not the 3d electrons that participate in the bonding but 
rather the 4s electrons. The 3d electrons remain localized, that is, attached to 
the original atom. | 


PROBLEMS 


21.1 The differential equation for ®(¢), Eq. 21.4, 
can be rewritten as 


0 
dd’ 
In this form, the equation is similar to the differential 
equation for the particle in the infinite potential well, 
Eq. 20.19. Assume a solution of the form ® = e%” 


and show by direct substitution that such a function 
is a solution provided a = +im;, where = V —1. 


+m? ® = 0 


21.2 In Fig. 21-1 it is seen that a point P with co- 
ordinates r, 6, and @ is the same point as a point 
with coordinates r, 0, and @ + 27. Because the ei- 
genfunctions must be single-valued ®(¢) = O(¢ + 
277). Use the solution found in Problem 21.1 to show 
that this requirement limits the values that m, can 
take. That is, show that m, = 0, +1, +2, +3,.... 


21.3. In the Bohr model of the hydrogen atom, the 
electron is assumed to move in circular orbits around 
the proton, that is, the motion takes place in a plane 
that we can call the x-y plane. Use the uncertainty 
principle in the z direction, that is, Ap, Az = f and 
the fact that p2 = (Ap,)? (see Problems 19.21 and 
19.22) to show that the motion of the electron cannot 
be planar motion. 


21.4 Singly ionized helium behaves as a hydrogen 
atom but with twice the nuclear charge of the hy- 
drogen atom. What is the ground-state energy of the 
remaining electron in singly ionized helium? 
(Answer: 54.24 eV.) 


21.5 (a) Use Eq. 21.9 to compare the spectrum of 
energies of singly ionized helium with the spectrum 
of hydrogen. (b) Draw an energy diagram showing 
the first few energy levels for hydrogen and for 
singly ionized helium. (c) What is the wavelength 
of the photon emitted when the remaining electron 
in singly ionized helium makes a transition from 
n=2ton = 1? 


21.6 (a) How many atomic states are there in hy- 
drogen with n = 3? (b) How are they distributed 
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among the subshells? Label each state with the ap- 
propriate set of quantum numbers n, I, m, m,. (c) 
Show that the number of states in a shell, that is, 
states having the same n, is given by 2n?. (Hint: 
14+24+3+...+n= n(n + 17/2.) 


21.7. Ahydrogen atom is ina state with] = 3. What 
are the allowed values of L, and ;,? 

(Answer: 0, +h, 2h, +3h; 0, xeh/2m, t+eh/m, 

+ 3eh/2m.) 


21.8 A ball with a mass of 100 g rotates in a circle 
of radius 1 m with a speed of 4 m/sec. How many 
possible orientations can the angular momentum of 
the ball take? 

(Answer: 4 x 10°°.) 


21.9 Draw an energy-level diagram and calculate 
the separation between adjacent normal Zeeman en- 
ergy levels for the 3d state of an atom placed in a 
magnetic field of 1.5 T. 


21.10 A certain transition in an atom of wavelength 
A = 4226 A, when observed in a magnetic field of 
1 T, exhibits a normal Zeeman pattern whose com- 
ponents are separated by 1.4 x 10'° Hz. Calculate 
the ratio of the charge to the mass of the electron 
from these data. 


21.11 A spectrometer is capable of resolving spec- 
tral lines separated by 0.1 A when the wavelength 
is 5000 A. What is the smallest magnetic field needed 
to observe the normal Zeeman effect in such a case? 

(Answer: 0.86 T.) 


21.12 A beam of hydrogen atoms emerges from an 
oven with a velocity v = 10* m/sec. The beam of 
atoms passes through a region 5 cm long with a 
magnetic field gradient dB/dz = 10° T/m. The direc- 
tion of the field gradient is perpendicular to the in- 
cident velocity. The atoms then continue through a 
field-free region for another 20 cm before being de- 
posited on the screen. What is the separation be- 
tween the two lines on the screen? 

(Answer: 12.5 cm.) 
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21.13 Electron spin resonance refers to the absorp- 
tion of electromagnetic radiation by the electrons in 
atoms as they make transitions from the state where 
their spins are parallel to an external magnetic field 
to the state where the spins are antiparallel. At what 
frequency v would spin resonance occur with hy- 
drogen atoms in a magnetic field B = 1.5 T? 
(Answer: 4.2 X 10!° Hz.) 


21.14 The radial probability function for the ground 
state of the hydrogen atom is P(r) = A r? e~7', 
where A is a constant and fg is the radius of the 
smallest orbit in the Bohr model. For what value of 
ris P(r) a maximum? 

(Answer: fo.) 


21.15 (a) Calculate the ground-state energy of a 
system of 10 electrons in a one-dimensional infinite 
potential well of width a = 1 A? (b) What is the 
average energy per electron? 
(Answer: (a) 4.1 x 10° eV, 
(b) 4.1 x 10? eV/electron.) 


21.16 Repeat Problem 21.15 for a system of 10 elec- 
trons in a two-dimensional well of width 1 A and 
length 1 A. See Problem 20.19. 
(Answer: (a) 2.2 x 10° eV, 
(b) 2.2 x 10° eV/electron.) 


21.17 Suppose there were atoms having electrons 
with the principal quantum number up to and in- 
cluding n = 7. What would be the maximum number 
of elements? 

(Answer: 280.) 


21.18 What is the electronic configuration of Al 
(Z = 13), Si(Z = 14), P(Z = 15), Ca (Z = 20), and 
Br (Z = 35)? 


21.19 The electron configuration of sulfur (16 elec- 

trons) is 1s72s*2p°3s?3p*. Write a complete set of 
quantum numbers for the four electrons in the 3p 
subshell. 


21.20 No two electrons in a subshell pair up with 
opposite spins until there is at least one electron in 
each state (each value of m);) of the subshell. This is 
known in spectroscopy as Hund’s rule. In view of 
this, what combinations of four among the set of 
quantum numbers found in Problem 21.19 are ac- 
ceptable for the four electrons in the 3p subshell of 
the sulfur atom? 


21.21 (a) What is the electron configuration of Li 
(Z = 3)? (b) Assuming that Li is a hydrogen-like 
atom, calculate the ionization energy of the 2s val- 
ence electron. (c) The experimentally determined 
ionization energy of Li is 5.39 eV; what is the effec- 
tive positive charge seen by the 2s electron? 


21.22 Potassium belongs to the alkali elements group 
with one valence electron in the 4s subshell. The 
ionization energy of potassium is 4.34 eV. What is 
the effective charge seen by the 4s valence electron? 

(Answer: 2.26 |e|.) 


Crystal Structures 
and Bonding in Solids 
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22.1 INTRODUCTION 


Matter comes in three forms or phases: gases, liquids, and solids; the last two 
are called condensed systems. Semiconductors are crystalline solids. To under- 
stand their electrical properties we need a basic knowledge of their crystal 
structure. 


Gases 


In a molecular gas the average distance between the molecules is large com- 
pared with their size. As a result, the intermolecular forces (forces between 
molecules) are small compared with the forces that bind the molecules to- 
gether. Because of this, the relative positions of the molecules are completely 
random. | 


Liquids and Solids 


The atoms and molecules in condensed systems are close together; the forces 
between molecules are comparable to the forces within the molecules. In a 
liquid the molecules form temporary short-range order arrangements that are 
continuously broken by the high thermal energy of the molecules. When a 
liquid is slowly cooled, the molecules will arrange themselves in a crystalline 
array that produces the maximum number of bonds and thereby leads to a 
state of minimum energy. If the liquid is cooled too fast, the internal energy 
will be removed before the molecules have a chance to arrange themselves. 
As a result, a solid is formed that can be considered a “snapshot” of the 
liquid: We have an amorphous solid that displays short-range order but not 
long-range order. Short-range order usually means a few atoms and long- 
range order thousands of atoms. Amorphous solids are called glasses or glassy 
solids. Because of the lack of long-range order in glasses, the bonds between 
different atoms or molecules vary in strength. When such a solid is heated, 
the weaker bonds break first, leading to a gradual softening before a complete 
meltdown occurs. Because of this, amorphous materials lack a well-defined 
melting point. 

We will be dealing primarily with crystalline solids and, although a de- 
tailed knowledge of the crystal structure is not necessary to obtain a semi- 
quantitative understanding of electrical conduction processes, we will discuss 
briefly a few facts about crystal structures so that the structure of the semi- 
conductor silicon can be understood. 


22.2 CRYSTAL STRUCTURES 


A crystalline solid is a three-dimensional, periodic array of atoms or molecules 
called a crystal structure. The most convincing evidence concerning the regular 
arrangement of atoms in a solid comes from X-ray diffraction (Section 12.7). 
The diffraction pattern not only confirms the periodic arrangement, but also 
has allowed crystallographers to determine the arrangement. 
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FIGURE 22-1 


Specification of a crystal struc- 
ture. (a) Regular periodic ar- 
rangement of mathematical 
points in space called the space 
lattice. (b) Basis: atom or group of 
atoms. (c) Basis placed at or near 
each lattice point yields a crystal 
structure. 


One might guess from seeing snowflakes that there is an infinite number 
of possible lattice arrangements, but such is not the case. It turns out that all 
crystal structures can be included in one of fourteen lattice arrangements. 

A crystal structure can be specified by a periodic space lattice and an atom 
or group of atoms placed at or around each lattice point. The atom or group 
of atoms constitutes the basis (see Fig. 22-1b). The space lattice is a regular 
periodic arrangement of points in space and is purely a mathematical abstrac- 
tion (see Fig. 22-1a). To obtain a crystal structure, we must place at or around 
each lattice point a basis of atoms, Fig. 22-1c. This group of atoms must be 
identical in composition, arrangement, and orientation. 

One important fact about a space lattice that can be readily seen from 
Fig. 22-1a is that every lattice point has identical surroundings. The grouping 
of lattice points about any given lattice point is the same as the grouping 
about any other point. It turns out that geometrically there are only 14 unique 
ways in which this can be done: This gives rise to the 14 Bravais lattices shown 
in Fig. 22-2, named after the discoverer. Any lattice can be constructed by 
putting side by side one of these 14 Bravais lattices. These lattices are all 
parallelepipeds that differ from one another in the relative sizes of the three 
basic sides and/or the angles between them. 

Let us consider some crystal structures. 
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FIGURE 22-2 
The 14 Bravais space lattices. 
(continued on page 350) 
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| FIGURE 22-2 
Trigonal Hexagonal (Continued) 
NaCl Structure 
This is a face-centered cubic (fcc) Bravais lattice (Fig. 22-3a). The basis consists 
of a Na atom and a Cl atom separated by one-half the body diagonal, Fig. 
22-3b. Other materials having the same structure include KBr, KCI, AgBr, 
MgO, MnO, and PbS. 
FIGURE 22-3 


(a) Face-centered crystal struc- 
ture of sodium chloride (NaCl). 
(b) Basis of the NaCl crystal 
(a) (b) structure. 
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CsCl Structure 


This is a simple cubic Bravais lattice (Fig. 22-4a). The basis consists of a Cl 
atom at the corner and a Cs atom separated by one-half the body diagonal, 
Fig. 22-4b. Other materials having this structure include AgMg, AINi, CuZn 
(brass), and BeCu. 
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‘4 oe : 2 (a) Cubic crystal structure of ce- 
: , ee | sium chloride (CsCl). (b) Basis of 
= “S7 ei the CsCl crystal structure. 
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Diamond Structure 


It is a face-centered cubic (Fig. 22-52). The basis consists of two identical atoms, 
one is situated at the corner of the cube and the other is displaced by one- 
quarter the body diagonal along that diagonal (see Fig. 22-5b). The diamond 
structure can be thought of as being made of two interlocked fcc structures, 
each with one atom per lattice point. These two fcc structures are displaced 
from each other by 4d along the body diagonal. 

The structure of diamond gives rise to a tetrahedral bond arrangement 
where each atom can be considered to be at the center of a tetrahedron forming 
one bond with each of its four nearest neighbors. These appear to be located 
at the four corners of the tetrahedron (see Fig. 22-6). Materials having this 
type of structure include C (carbon in the diamond structure), 5i (silicon), Ge 
(germanium), and Sn (tin). 
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FIGURE 22-6 
Tetrahedral arrangement of at- 


oms in diamond resulting from 
FIGURE 22-5 the crystal structure of Fig. 22- 
(a) Face-centered crystal structure of diamond. (b) Basis of the diamond crystal struc- 5a. Each carbon atom in the dia- 
ture. The diamond structure can be thought of as two interlocked face-centered struc- mond structure is at the center 
tures displaced from each other along the body diagonal by one quarter of the diagonal of a tetrahedron with its four 
of the unit cube. For the sake of clarity in the drawing, only some of the atoms in the nearest neighbors located at the 
unit cube of the structure have been drawn in the figure. corners of the tetrahedron. 
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22.3 CRYSTAL BONDING 


As mentioned before, the exact knowledge of the crystal structure is not 
necessary in order to obtain a semiquantitative understanding of conduction 
processes in solids. However, the scientist who wants to make exact calcu- 
lations of some property of a particular crystal must have a detailed knowledge 
of the crystal structure of that material. For our purposes it is more important 
to achieve an understanding of the different mechanisms that hold the struc- 
ture together, that is, the different types of bonding. There are four general 
types of bonding that we discuss next. The actual bonding is usually a mixture 
of two or three. | 


22.3a lonic Crystals (lonic Bonding) 


Ionic crystals consist of positive and negative ions in a periodic array, for 
example, Figs. 22-3 and 22-4. This is a result of an atom losing one or more 
electrons and another atom capturing them. For this to occur, the exchange 
must be energetically favorable. This means that if one starts with two neutral 
atoms, say Na and Cl, and goes through the process of ionizing one of them, 
then giving the resulting electron or electrons to the other and bringing the 
two ions together, the overall energy of the bound pair must be less than the 
energy of the initial state of two neutral atoms. As we have discussed in 
Chapter 21 in connection with the periodic table, this situation occurs (al- 
though not exclusively) with the alkali elements that have a weakly bound 
s electron and the halogen elements that need one electron in order to have 
an inert configuration. To ionize Na, an energy of 5.14 eV must be provided, 
which may be written as 


Na + 5.14 eV — Na* + e7 


On the other hand, when Cl captures one electron, this electron will become 
bound, and the energy released will be 3.62 eV. We may write the reaction 
as 


Cl+e7> —Cl + 3.62 eV 


If now we bring Na* and Cl” together until the separation between them is 
2.51 A (this is the equilibrium separation between the centers of the two ions 
in the NaCl molecule), the Coulomb attraction potential energy E,, will be (see 
Eq. 14.9) 

1 2 242 (1.6 x 10°" Cy’ 


ES oes Oe 
0 Ane, ms 951 x 10-2 m 


~9.18 x 10°" J = —5.73 eV 


This represents a decrease of energy from the situation where the two ions 
are infinitely far apart; that is, this represents a release of energy. The net 
energy released is 


E = 3.62 eV + 5.73 eV — 5.14 eV = 4.22 eV 


If E = Ois the reference energy state when the Na and Cl atoms are far apart, 
when they are ions at r = 2.51 A their energy is —4.22 eV. In other words, 
they fall into a potential well of depth — 4.22 eV. This is the energy needed 
to break apart the NaCl molecule and restore each ion to electrical neutrality. 
In an ionic crystal, the binding energy is the sum of all the attractive 
forces (between all ions of opposite sign) and all the repulsive forces (between 
ions of like sign). For an ionic crystal this energy can be written as 
2 
ee (22.1) 


A47TEo r 


where a is known as the Madelung’s constant and r is the distance between 
two nearest ions. The Madelung constant is different for different crystal 
structures. For the fcc NaCl structure, a = 1.7476. Let us look at this particular 
structure. The Nat ion has six nearest neighbors (Fig. 22-3a) at a distance r 
(six Cl~ ions). Therefore the energy contribution is 

1 e 


= 22.2 
A7TEo r ( ) 


be =-=6 


The next nearest neighbors are 12 Na* at a distance (2)\?r (Fig. 22-3a) and 
contribute a repulsion energy of 


Bie = 12 = e (22.3) 
— ATE, 2i2y ° 
At the next distance there are eight Cl~ at (3)”r 
1 @ 
= = ap, 204 
a A4TTEp 3127 ( ) 
Then there are six Na” at 2r 
1 @ 
Es = +6 =a 22.5 
: A47TE 2r ( ) 


and so on. The total binding energy obtained by combining all these contri- 
butions, Eqs. 22.2, 22.3, 22.4, 22.5, ...,1S 


- 1 ef, 2,8 8... 
Total Ar€éy V/2 V3 2 


and the summation of the series was evaluated by Madelung for fcc crystals 
as 1.7476. The total binding energy is then 


Z 


é 
= 22: 
A47TrE r ( ?) 


Etotal = —] .7476 


In arriving at Eq. 22.6, we have considered a Na* ion and have calculated 
the electrostatic potential energy due to the other ions. We have assigned all 
this energy to the Na* ion. Actually, when we consider two charges q; and q2 
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separated by a distance r and write that the electrostatic potential energy is 


1 192 


Amey 1 


this energy is the energy of both charges. The electrostatic potential energy 
associated with one charge is half that amount. Thus, in the present case, 
the binding energy of the Na* ion under consideration is half the amount 
given by Eq. 22.6. When the values for e and r (2.8 A for crystalline NaCl) 
are substituted into the Eq. 22.6 and the division by two is performed, we 
obtain a binding energy per ion as ~ 4 eV/ion. This is a rather strong bond. 
To get an idea, consider that the thermal energy at room temperature kgT is 
0.025 eV. This is the reason why ionic crystals are extremely hard and have 
a high melting point (801° C for NaCl, 845° C for LiF as examples). We should 
also note that all the electrons are bound to the ions, which, with the electron 
exchange, now have completely filled electron subshells. Because there are 
no free or loosely bound electrons to transport charge when an electric field 
is applied across the crystal, simple ionic crystals do not conduct electricity 
and belong to the class of insulators. 


22.3b Covalent Bond 


Another important and very strong type of bond is the covalent bond. A good 
example of this is the H, molecule. The bond comes about because both 
electrons are in orbit (so to speak) around both nuclei, Fig. 22-7. Because 
electrons repel each other, the largest average distance apart can be main- 
tained when their motion is coupled so that at a given time each is on the 
same side of each nucleus. Therefore, one or the other of the electrons spends 
part of its time between the two nuclei, attracting both of them and thus 
binding them. One way to understand why this takes place is by considering 
the tendency of the atoms to have filled subshells. The hydrogen atom needs 
one more electron to fill the 1s subshell and thus achieve the inert electronic 
configuration of helium. Lacking any other source of electrons, it will try to 
capture the extra electron from the adjacent hydrogen atom. Of course, the 
other atom will try to do the same, and they compromise by sharing both 
electrons. 

This can be put into a more rigorous quantum mechanical language. 
Consider two identical H atoms, each with its electron in the 1s state. When 
the two atoms are far apart the wavefunctions do not overlap, and therefore 
the electrons remain with their parent atoms. The situation is shown in Fig. 
22-8, which represents the radial dependence of the 1s wavefunction for 
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FIGURE 22-7 


Schematic representation of the 
electrons of two bound hydro- 
gen atoms orbiting around the 
nuclei and in the process creat- 
ing a covalent bond between the 
two atoms. 


FIGURE 22-8 
Radial part of the ground-state 


eigenfunctions of two isolated 
hydrogen atoms as a function of 
the distance of the electrons from 
their respective nuclei. 


hydrogen atoms numbered 1 and 2. Note that each wavefunction is radially 
symmetric about a nucleus. If, however, the two atoms are brought close 
together until the nuclear separation is ~ 0.7 A (the equilibrium separation 
between the two hydrogen atoms in the H2 molecule), the two wavefunctions 
overlap significantly, see Fig. 22-9. Because the electron from one atom can 
be found close to the nucleus of the other, it can be captured by the latter, 
and as a result, for a certain fraction of its time a given electron will be found 
with each proton. Because the two electrons are indistinguishable, it can be 
said that they spend equal time with each proton. The wavefunction asso- 
ciated with the electrons must reflect this fact. There are two possibilities, 
which will be discussed in detail in Chapter 24 (Section 24.4). The one that 
leads to the covalent bond is the sum of the two wavefunctions y = x; + Y2, 
shown in Fig. 22-10. This wavefunction satisfies the condition that each elec- 
tron can be found equally with either of the two protons because it is sym- 
metric about the midpoint between the two protons. But it also has the in- 
teresting feature that electrons represented by such a wavefunction spend a 
considerable amount of time between the two protons. When that happens, 
both protons are attracted toward the center by the electron and thus toward 
each other, thereby producing the bonding. In a single covalent bond each 
atom contributes one electron to the bond. 

The best examples of covalent bonding in solids are offered by carbon in 
the diamond structure, silicon, and germanium. These elements belong to 
group IV of the periodic table (see Fig. 21-17). The outermost electron con- 
figuration consists of a filled s subshell followed by a p subshell with two 
electrons (C: 1s?2s?2p?; Si: 15?2s*2p°3s*3p"; Ge: 1s°2s?2p°3s*3p°3d'°4s*4p’). To 
have a full p subshell, each needs .four more electrons. They achieve this by 
sharing four electrons with four other atoms. Each atom provides two elec- 
trons from the outer p subshell and two from the s subshell in order to form 
four covalent bonds. Even though the two electrons in the s subshell form a 
closed subshell, they participate in the bonding because they are very close 
in energy to the next p subshell (see Fig. 21-18). The most stable arrangement 
of the four bonds occurs when the atoms are symmetrically arranged in space. 
This symmetry is achieved by having a tetrahedral structure. Each atom is at 
the center of a tetrahedron and makes a covalent bond with its four nearest 
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FIGURE 22-9 

Overlap of the eigenfunctions of 
two individual atoms when the 
atoms are brought close together 
to form a molecule. 


FIGURE 22-10 


Quantum mechanical basis of the 
covalent bond. The eigenfunc- 
tion y = x1 + x2 associated with 
an electron that can be found with 
equal probability with either of 
the two nuclei. Note that be- 
cause the value of y is large in 
the region between the two nu- 
clei, the electron represented by 
such an eigenfunction spends 
considerable amount of time be- 
tween the nuclei and in so doing 
binds them together. 
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neighbors, which are located at the corners of the tetrahedron (see Fig. 22- 
6). Each corner atom, in turn, finds itself at the center of another tetrahedron, 
so that the structure consists of a system of interlocked tetrahedra. The struc- 
ture is in fact two interpenetrating face-centered cubic lattices that are dis- 
placed along the diagonal by one quarter of the diagonal of the cube (Fig. 
22-52). 

The covalent bond is very strong, for example, 7.4 eV for diamond, 12.3 
eV for SiC (carborandum). Moreover, because all available valence electrons 
pair up and orbit around pairs of atoms, it is difficult to dislodge them to 
conduct electricity and therefore covalently bonded solids belong to the class 
of insulators or semiconductors. 


22.3C Metallic Bond 


The metallic bond can be thought of as the limiting case of the covalent bond 
in which electrons are shared by all the ions in the crystal. 

When a crystal is formed of atoms that have a few weakly bound electrons 
in the outer subshells, these electrons become free from the individual atoms, 
which thereupon acquire stable closed subshell configurations. The energy 
needed to liberate these loosely bound electrons is more than compensated 
for by the decrease in energy resulting from the binding. 

We can visualize the metallic bonding as follows. The liberated electrons 
move through the entire crystal, visiting each positive ion at some time or 
other. We thus have a situation where an array of heavy positive ions is 
permeated by a “sea” of highly mobile negative electrons (see Fig. 22-11). It 
is the electrostatic attraction between the electron sea and the positive ions 
that prevents the breakdown of the entire structure, which would result from 
the repulsion of the positive ions. The electrons provide the “glue” that keeps 
the structure together. 

The metallic bond is in many ways similar to the ionic bond in the sense 
that the main role is played by the electrostatic attraction between unlike 
charges; however, there is a big difference. Whereas in the ionic crystal the 
position of the positive and negative charges and therefore the directionality 
of the forces is fixed, in the metallic bond they are not. The electrostatic 
attraction comes from all directions. This is important, because it explains 
why a small deformation in a nearly perfect metal crystal does not cause a 
fracture. Whether we compress, twist, or pull a piece of metal, the cohesive 
forces are still there and coming from all directions. Pure metals are ductile 
and malleable. The presence of highly mobile charges will be used in Chapter 
23 to explain the excellent thermal and electrical properties of metals. 


22.3d Molecular or van der Waals Bond 


We have seen that when the outer subshells of an atom are not filled, the 
atom tries to fill them by associating with an atom that can supply one or 
more electrons. The two atoms then become bonded in the process. What 
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Nature of the metallic bond. As 
the free electrons in the metal 
move through the solid they at- 
tract the positive lattice ions to- 
ward them and thus prevent the 
breakdown of the structure. 


happens if the atom has an outer subshell that is completely filled and difficult 
to excite, such as is the case with the rare gases? The answer is that it is very 
difficult for them to bond and therefore to form molecules and solids. How- 
ever, they (except He) can form solids at very low temperatures by another 
mechanism. The mechanism is generally known as dipole-dipole interaction, 
and the resulting forces are called van der Waals or London forces. 

Before we consider this effect in the rare gases, let us examine the case 
of the dipole bonding of water, H,O. There are molecules (polar molecules) 
that possess permanent electric dipole moments. An example is HO, in which 
the molecule is formed by covalent bonding. The two 1s electrons from the 
H atoms form covalent bonds with the 2p electrons from the O atom. The 
result is that the two H atoms as well as the O atom will complete their 
subshells, a situation that yields a stable low-energy configuration. The re- 
sulting electron configuration, however, is not symmetric. The electron con- 
centration around the oxygen atom makes that end of the molecule more 
negative than the ends where the hydrogen atoms are. The partially positive 
hydrogen atoms repel one another, which contributes to the angle of their 
bonds (see Fig. 22-12a). The net result is that the molecule has a permanent 
dipole moment, Fig. 22-12b. Such molecules tend to align themselves in such 
a way that the ends of opposite charge are adjacent and thus attract each 
other, forming chains and two- or three-dimensional structures, as indicated 
in Fig. 22-13. A polar molecule is also able to attract molecules that do not 
have a permanent dipole moment. The electric field of the polar molecule 
causes a separation of charge in the other molecule; that is, it induces a dipole 
moment in the same direction as its own, as shown in Fig. 22-14. The result 
is an attractive force. (This is analogous to what happens to an unmagnetized 
piece of iron in the presence of a magnet.) Two molecules or atoms with no 
permanent dipole moment, such as the rare gases, can still attract each other. 
Even though the electron distribution is symmetric on the average, the electrons 
themselves are in constant motion, and at any given instant, one part of the 
molecule can be more negative than another. In the polar molecule, the charge 
asymmetry is fixed, whereas in the nonpolar molecule there is a constantly 
shifting charge asymmetry. When two such nonpolar molecules are close 
together, their fluctuating charge distribution tends to couple and move to- 
gether so that adjacent ends always have opposite signs, thus mutually in- 
ducing dipoles with a resulting attraction. 

We may examine the effect of the moving charges in atoms in much the 
same way as we did in considering the covalent bond. When two atoms are 
brought close enough so that the electrons of one atom experience coulombic 
forces from the electrons of the other atom, and vice versa, they seek the 
lowest energy configuration. If the electrons circled the nuclei at random, 
then at times two electrons would be on the sides of the nuclei adjacent to 
each other, even though at other times they would be on opposite sides of 
their nuclei at the farthest distance from each other (see Fig. 22-15a and Db). 
It was shown quantum mechanically that the average electron energy of the 
a-b type shown in Fig. 22-15a and 22-15b is higher than that of the coupled 
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FIGURE 22-12 
(a) Covalent bond of the oxygen 


and hydrogen atoms in the water 
molecule (HO). (b) Net electric 
dipole moment of the H2O mol- 
ecule. 
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FIGURE 22-13 
Chain of molecules with per- 
manent electric dipole moment 
created when the sides of op- 
posite charge of the molecules 
align themselves. 


a> 


FIGURE 22-14 
A molecule with permanent elec- 


tric dipole moment (a polar mol- 
ecule) induces a dipole moment 
in a nonpolar molecule when the 
two molecules are close to- 
gether. 
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(a) and (b) Uncoupled motion of electrons in two individual atoms resulting in a 
situation where at times the electrons from the two atoms are on adjacent sides of the 
nuclei and at other times they are on opposite sides of the nuclei. (c) and (d) Coupled 
motion of electrons in two atoms as a result of which the electrons are never close 
together. This coupled motion gives rise to attractive forces between the two atoms, 
as can be seen from Fig. 22-15c and d. These forces are known as London or van der 
Waal forces. 
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types, Fig. 21-15c and d, in which the electrons “see” each other and stay as 
far away as possible. This reduction of energy by the coupling of electrons 
is called the London force after Fritz London, who first calculated it in 1927. 
This electron coupling occurs for all atoms, and this low energy attraction is 
also called van der Waal’s forces after the chemist who analysed the conden- 
sation of gases. Although London forces exist between all atoms and mole- 
cules, they can be observed in their isolated state only in the rare gases because 
other atoms and molecules have additional forces, discussed earlier, that mask 
the weaker London forces. It is these forces that cause rare gases to condense, 
but only at very low temperatures because the resulting binding energy of 
the atoms is smaller than the thermal energy at higher temperatures. All the 
rare gases condense, and at sufficiently low temperatures they will solidify 
into crystalline solids, except for helium. Helium will solidify only under high 
pressure and low temperature. 


PROBLEMS 


22.1 Copper has a face-centered cubic structure with 
a one-atom basis. The density of copper is 8.96 g/cm 
and its atomic weight is 63.5 g/mole. What is the 
length of the unit cube of the structure? 

(Answer: 3.61 A.) 


22.2 Sodium has a body-centered cubic structure 
with a one-atom basis. The density and the atomic 
weight of sodium are 0.971 g/cm? and 23 g/mole, 
respectively. What is the length of the unit cube of 


the structure? 
(Answer: 4.29 A.) 


22.3 Assuming that atoms in a crystal structure are 
arranged as close-packed spheres, what is the ratio 


of the volume of the atoms to the volume available 
for the simple cubic structure? Assume a one-atom 


basis. 
(Answer: 0.52.) 


22.4 Repeat Problem 22.3 for the body-centered cu- 


bic structure. 
(Answer: 0.68.) 


22.5 Repeat Problem 22.3 for the face-centered cu- 
bic structure. 
(Answer: 0.74.) 


22.6 A given ionic crystal has the same structure 
as NaCl. The separation between adjacent ions is 3.2 


A. What is the binding energy of each ion, and would 
you expect it to have a higher or lower melting point 
than NaCl? 

(Answer: 3.93 eV.) 


22.7 As we saw in Section 22-3a, the dissociation 
energy (energy to break up the molecule) of the NaCl 
molecule is 4.22 eV. Many chemical handbooks list 
the dissociation energy in kcal/mole. What is the 
dissociation energy of NaCl in kcal/mole? 

(Answer: 97 kcal/mole.) 


22.8 The equilibrium separation between the cen- 
ters of the two ions in a HCI molecule is 1.27 A. The 
ionization energy for the hydrogen atom is 13.56 eV. 
The electron affinity of Cl (that is, the energy re- 
leased when the Cl atom captures an electron) is 
3.62 eV. What is the dissociation energy (the energy 
needed to break up the molecule) for the HCl mol- 
ecule? 

(Answer: 1.40 eV.) 


22.9 The dissociation energy of the KF molecule is 
5.12 eV. The ionization energy for K is 4.34 eV, and 
the electron affinity of F is 4.07 eV. What is the 
equilibrium separation constant for the KF molecule? 


22.10 Consider a one-dimensional lattice consist- 
ing of alternating positive and negative ions. Show 
that the Madelung constant is 1.386. (Hint: In (1 + 
x)= x — 7/2 + 3 — x44 +...) 

22.11 What type of bonding mechanism can we 
expect for (a) the KCl molecule, (b) the CH, (meth- 


ane) molecule, (c) the CO, molecule, (d) the Al atoms 
in a solid, and (e) the Oz molecule? 


22.12 We indicated in Section 22.3a that in the NaCl 
molecule the bonding is ionic. Actually, the wave- 
function of the 3s electron that is transferred from 


PROBLEMS 359 


Na to Cl, although small, is not zero in the space 
between the ions. As a result, the 3s electron spends 
some time between the two atoms (is shared by both 
atoms). Thus, the bonding, although mostly ionic, 
is to a small degree covalent. Consider the NaCl 
molecule as an electric dipole, that is, two equal 
charges, one positive, one negative (q = 1.6 x 
10-19 C) separated by distance d = 2.4 A (the equi- 
librium separation of the ions in the molecule). (a) 
What is the electric dipole moment, uw, = qd, of the 
molecule? (b) The measured electric dipole moment 
is 3.00 x 10-*? C-m, to what degree is the bond 
ionic; that is, what is the ratio of the actual dipole 
moment to the theoretical one? 

(Answer: (a) 3.84 x 10~*? C-m, (b) 0.78.) 


22.13 Barium has two valence electrons, whereas 
oxygen needs two electrons to achieve an inert elec- 
tron configuration. One would expect that in the 
BaO molecule the binding would be ionic, the bar- 
ium atom giving two electrons to the oxygen atom. 
The equilibrium separation between the ions in the 
BaO molecule is 1.94 A, and the measured electric 
dipole moment of the molecule is 2.65 x 10~*? C-m. 
To what degree is the bond ionic? (See Problem 22.12.) 


99.14 In Section 22-3b we indicated that one of the 
possible eigenfunctions for the electrons in the H, 
molecule is y = y; + X2, where y; and x2 are the 
ground-state eigenfunctions of the individual atoms 
(see Fig. 22-10). In the ground state of the H2 mol- 
ecule, both electrons, with opposite spins, have this 
particular eigenfunction associated with them. The 
ionization energy (energy to remove one electron) 
of H> is 15.7 eV, whereas for H it is 13.6 eV. Explain 
why the ionization energy is greater for the hydro- 
gen molecule than for the hydrogen atom. 


Free Electron Theories 
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23.1 INTRODUCTION 


If a battery is connected in series with an ammeter and a piece of some material 
(copper, silicon, glass, or such), a deflection in the ammeter will indicate a 
flow of charged particles (a current 1) through the circuit (see Fig. 23-1). 
Simultaneously there will be a voltage drop V across the sample as measured 
by a voltmeter. If the voltage of the battery is varied, 1 will vary proportionally 
with V. It can therefore be stated that if a voltage V is applied across a sample, 
a current proportional to it will flow: 


V=Ri (23.1) 


where R is the proportionality constant and is called the resistance of the 
sample. Equation 23.1 is known as Ohm’s law and was introduced in 
Chapter 15. , 

Ohm/’s law can be restated in a form more appropriate to the understand- 
ing of the phenomenon of conduction by concentrating attention on the sam- 
ple. The fact that there is a potential difference V across the sample means 
that there is an electric field € in the sample. If the sample is uniform in 
geometry and quality, will be constant, and it follows that 


Vab 


| 


b 
v= | €dx = €a 


V = €d (23.2) 


where d is the length of the sample. 

Given a certain potential difference (and therefore a certain ©), the larger 
the cross-sectional area A of the sample (Fig. 23-2), the larger the current will 
be. (Think of the analogy of a pipe with flowing water.) We can eliminate 
the geometric parameter A by introducing a new quantity, the current density 
J, defined as the current per unit cross-sectional area (see Section 15.3). 

J= or 1=JA (23.3) 
Substituting Eq. 23.2 for V and Eq. 23.3 for i into Ohm’s law, Eq. 23.1, results 
in the relation 


éd 


A 
RAJ or €=R—J 


and 
€E=pl (23.4) 


where the quantity p = R A/d is called the electrical resistivity and has dimen- 
sions of ohm-meters (Q-m). Ohm’s law can be expressed another way by 
introducing the term electrical conductivity, @ = 1/p, which has dimensions of 
reciprocal ohm-meters (Q1-m)~*. Equation 23.4 can then be expressed as 


J=a@ (23.5) 


FIGURE 23-1 


FIGURE 23-2 


CLASSICAL FREE ELECTRON (CFE) MODEL 


Equations 23.4 and 23.5 are two equivalent forms of Ohm’s law. Electrical 
resistivity p and electrical conductivity o are both quantities characteristic of 
the material and independent of the geometry of the sample. 

One of the reasons that the electrical properties of solids are so interesting 
and have drawn so much attention can be found by looking at Table 23-1, 
which lists the electrical conductivity of example materials at room temper- 
ature. The first and obvious thing one notices is the tremendous range of 
values, about 27 orders of magnitude (powers of 10). Any satisfactory theory 
of electrical conduction must be able to explain not only Ohm’s law but also 
such large differences in conductivity. There are additional effects to be ex- 
plained such as the effect of temperature, impurities, and alloying on the 
electrical conductivity of solids. 


TABLE 23-1 
Electrical Conductivity" 


Material ao (Q-m)~} 


Copper 6 x 10’ 
Aluminum 3 xX 10’ 
Iron 1 x 107 
Germanium 2 x 1072 
Silicon 4 x 1074 
Glass 2x 10™ 
Amber 2x 107 
Polystyrene 1 x 10°79 


“These values of o are at room tempera- 
ture. 


23.2 CLASSICAL FREE ELECTRON (CFE) MODEL 


The first successful attempt to understand the electrical properties of metals 
was presented by P. Drude in 1900 and was extended by H. A. Lorentz in 
1909. The results of these two investigators, as well as the work of others, is 
now called the classical free electron model (CFE). 


23.2a Assumptions of the Model 


1. The main assumption of the CFE model is that a metal is composed of an 
array of ions with valence electrons that are free to roam through the ionic 
array with the only restriction that they remain confined within the bound- 
aries of the solid. Because these valence electrons are responsible for the 
electrical conduction of the solid, they are called conduction electrons. 


2. The mutual repulsion between the negatively charged electrons is ne- 
glected. In addition, the potential energy due to the ions is assumed to be 
everywhere constant. These free electrons are basically treated as an ideal 


Paul Drude (1863-1906). 


H.A. Lorentz (1853-1929). 
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neutral gas that obeys classical Maxwell-Boltzmann statistics (see Supple- 
ment 9-1 at the end of Chapter 9). 


3. In the absence of an electric field, these electrons are moving with their 
random thermal velocities given by the ideal gas result (Eq. 9.21) 
1 — 3 
5 mv v2 = 5 Kal (23.6) 
where v2 is the average of the square of the thermal speeds; kz is Boltz- 
mann’s constant, 1.38 x 10~% J/K; and T is the absolute temperature of 
the solid and therefore of the electron gas. 


4. When an electric field @ is applied to the solid, the free electrons acquire 
an average drift velocity in the direction opposite to the electric field, thus 
producing an electric current. 


Despite the many simplifying assumptions, this CFE model was able to 
explain many properties of metals in a quantitative and satisfactory way. As 
we will show, it was able to predict Ohm’s law, as well as the Wiedemann- 
Franz law, an empirical formula relating the electrical and the thermal con- 
ductivities in metals, which will be discussed later. The CFE model was not 
totally adequate; subsequent models started with some of the basic assump- 
tions of the Drude-Lorentz model but modified them with new concepts. 
(Remember that quantum mechanics did not come onto the scene until 1926.) 


23.2b Ohm's Law: Derivation from the CFE Model 


Let us consider a piece of metal. According to the CFE model, there are a 
certain number of free electrons moving randomly with thermal velocities 
whose average value is obtained from Eq. 23.6. At room temperature, T = 
300 K, this is 


ORMS 


_ E ery 7 E x 1.38 x 10-3 J/K x 300 x] 
mM 


9.1 X 10-3 ke 


1.2 < 10° m/sec ~ 75 miles/sec 


where Ugms stands for root mean square (the square root of the mean of the 
square). Random motion means that, if you consider a hypothetical plane in 
the sample (Fig. 23-3), the rate at which electrons pass from the right to the 
left is the same as the rate at which they pass from the left to the right. 
Therefore, the net flow in any direction is zero, and there is accordingly no 
net electrical current. The way to obtain a net current is to bias the random 
motion of the electrons by applying an external force, that is, creating an 
electric field in the sample. This may be done by applying a potential difference 


FIGURE 23-3 
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across the sample. This biasing will have the effect of superimposing a net 
constant drift velocity on the random motion in the direction of the force. 
This concept was first introduced in Chapter 15 but will be repeated here as 
the starting point of further development of the theory. 

In Fig. 23-4 it is seen that, although the electrons bounce back and forth 
from collisions, there is a net drift velocity vz toward the right. One can 
visualize what is happening by thinking of a simple mechanical analogy. 
Consider a smooth, frictionless table with fixed pins uniformly distributed 
throughout. If you drop a large number of marbles on the table when it is 
flat, they will hit the pins and move randomly with no net motion in any 
given direction; that is, on the average as many marbles will hit one end of 
the table as the opposite one. However, if you tilt the table, then at the same 
time that the marbles bounce back and forth randomly from the pins, there 
is a net drift of all the marbles toward the lower end. Tilting the table effectively 
creates a potential difference (gravitational) between the higher and lower 
ends of the table. Why don’t the marbles acquire an acceleration in the di- 
rection of the force? Why only a constant drift velocity? The reason is the 
collisions with the pins. These collisions impede the downward motion of 
the marbles and therefore act as a retarding frictional force. Obviously, the 
more collisions per unit time, the greater the time rate of change of the forward 
momentum of the marbles—that is, the greater the retarding force. The drift 
velocity vz represents an equilibrium condition in which the rate at which the 
energy they receive from the external gravitational force equals the rate at 
which they lose it through collisions with the pins, which is the energy 
dissipated as heat. The situation is governed by Newton’s second law in 
which the vector sum of forces is zero, and therefore there can be no accel- 
eration between the top and bottom of the table although there are local 
accelerations between the pins. This analogy is close to the free electron model 
of the motion of electrons through a metal. An externally applied electric field 
replaces the gravitational field of the tilted table, and the metal ions replace 
the pins. The marbles are now electrons, and their motion resulting from the 
force of the electric field is impeded by successive collisions with ions. We 
will now derive Ohm’s law on the basis of this model. 

The definition of current is the rate at which an amount of charge Aq 
passes a given point, or 


food 


7. (23.7) 


In a time At the charges move a distance x = v, At. All charges that were to 
the left of plane A in Fig. 23-5 up to a distance x = v, At will pass by A in 
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FIGURE 23-4 
Random path of the electrons re- 


sulting from collisions with the 
ions (black lines) and the effect 
of an electric field on the path, 
which results in a net drift ve- 
locity v, in the direction opposite 
to the direction of the electric field 
(colored lines). (Source: David 
Halliday and Robert Resnick, 
Fundamentals of Physics, 2nd ed. 
Copyright © 1981 by John Wiley 
& Sons, Inc. Reprinted by per- 
mission of John Wiley & Sons, 
Inc.) 


FIGURE 23-5 
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At. If the number of charges per unit volume of the metal is N, then the 
number of charges that will flow to the right across A in time At will be N 
times the volume of a cylinder with cross-sectional area A and length x, or 


Number of charges = N A v, At 


If q is the charge of each electron, then the ainount of charge Aq that crosses 
A in time At is 


Aq = qNAv, At (23.8) 
Substitution of Eq. 23.8 in Eq. 23.7 obtains 
1 = qNAvg 


and by the definition of current density, Eq. 23.3, we have 
J = qNog (23.9) 


The dependence of J on the external electric field € is contained in v;. 

Consider the negative charge g shown in Fig. 23-6. The force acting on 
the charge resulting from the electric field @ is F = gé. From Newton’s second 
law 


F = ma = gé 
This may be written as 
dv 
ma = gé (23.10) 
and 
dv = ii dt 
m 
The velocity v at any time t is obtained by integration 
v £ 
i do = rf dt (23.11) 
0 m Jo 
and 
@ | 
Va = Tf (23.12) 
m 
where 
ve _, 
m 


by analogy to 
VU = Uo + at 


If t in Eq. 23.12 is taken as an average time 7 between collisions and vz from 


| = qNvq 


FIGURE 23-6 
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Eq. 23.9 is substituted into Eq. 23.12, then 


J _ 
gN om J 
or 
_ N¢r 
= (23.13) 


Comparing Eq. 23.13 with Eq. 23.5, we conclude that the conductivity o is 


q°N7 


= (23.14) 


og = 

The Drude-Lorentz theory predicts one of the key empirical facts about 
the electrical conductivity, Ohm’s law, and has given an explicit expression 
for electrical conductivity. To prove Ohm’s law fully, it must be shown that 
the expression for a, Eq. 23.14, is independent of @. This is equivalent to 
showing 7 independent of €, because q, N, and m by definition are indepen- 
dent of the electric field. In Fig. 23-7, | (mean free path) is taken as the average 
separation between ions. If v is the average velocity of the electrons, which 
is a combination of the thermal velocity vpms (determined by the temperature 
alone) and the drift velocity vz (which as we have seen depends on ©), we 
may write 


A simple calculation will show that vg << Upms in most cases; hence vz may 
be neglected in the calculation of 7 (see Example 23-1). Therefore the collision 
time 7 is given as 


(23.15) 


Example 23-1 


Consider a copper wire of cross-sectional area A = 1 mm¢* carrying a current 
i = 1 amp. What is the drift velocity vy of the electrons? Cu is monovalent, 
that is, there is one free electron per atom. The density and the molecular 
weight of Cu are 9 g/cm? and 64 g/mole, respectively. 


Solution From Eq. 23.9 
od 
a~ Ng 
The current density J is given by 
[= fe 10° amp/m? 


oN/ 


FIGURE 23-7 

In the CFE model the mean free 
path | (average distance between 
collisions of the electrons with the 
ions) is assumed to be the aver- 
age interionic separation. 
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Because copper is monovalent, the number of free electrons per unit volume 
N is equal to the number of atoms per unit volume Natoms. The latter can be 
found as follows: 


»Natoms = (number of moles/cm*) < (number of atoms/mole) 


The number of atoms per mole is given by Avogadro’s number Na = 
6.02 x 10*° atoms/mole. Thus 


3 
N= Nien = soon. x (6.02 x 10*° atoms/mole) 
64 g/mole 


8.4 x 1022 atoms/cm? = 8.4 x 107° atoms/m? 
Substituting for J and N into the expression for vg, we obtain 


10° amp/m* 


ee 
8.4 x 1078 atoms/m® x 1.6 x 10°" C Q~° m/sec 


Od 


As we saw earlier, at T = 300 K vems = 1.2 X 10° m/sec. Thus vg << vpgms 
and can be neglected as a contribution to v. Consequently, the relaxation time 
7 of Eq. 23.15 is determined by / and vgms, both of which are independent of 
6. Therefore, o is independent of ©. 

In addition to being able to explain Ohm’s law, the CFE was also able to 
explain the experimental fact that good electrical conductors are also good 
thermal conductors (see Supplement 23-1 at the end of this chapter). 


23.2c Failures of the CFE Model 


The CFE model is conceptually easy and succeeds in explaining certain im- 
portant experimental facts. It fails, however, to explain other experimental 
results. We will show later how many of the failures are corrected by the 
quantum mechanical model. The basic form of the conductivity equation will 
remain the same, the main difference being in the calculation of the time 
between collisions. What are some of the failures of the classical theory? 


1. Specific Heat 


As we indicated earlier, according to the CFE model, the valence electrons 
in a solid behave as an ideal neutral gas. In Chapter 9, Eq. 9.32, we showed 
that the molar specific heat of a gas at constant volume is 


E28 (9.32) 


where R is the universal gas constant. The contribution of the conduction 
electrons to the specific heat of a solid should therefore be given by Eq. 9.32. 
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Instead, experimentally the electronic specific heat per mole is 
C, = 107* RT (23.16) CC = 10° *arT 


Not only is the CFE model grossly incorrect in the actual value, but it predicts 
that the C, is temperature independent, whereas the experiment shows that 
C, 1s proportional to T. This represents an outstanding failure of the model. 
It is particularly puzzling in view of the success in explaining the electrical 
and thermal conductivities. In the CFE model it is impossible to comprehend 
how the valence electrons can participate in transport processes as if they are 
free and yet give such a small contribution to the specific heat, that is, be 
able to absorb heat. 


2. Temperature Dependence of o 


Experimentally, for a metal (except at very low temperatures) the electrical 
conductivity o is inversely proportional to the temperature T: o « T~!. Ac- ox T~! 
cording to the CFE model, Eq. 23.14, 


The only quantity that is temperature dependent is the time between collisions 
T = l/ugms. The average mean free path | will become smaller with increasing 
temperature because the ions vibrate with larger amplitude, thus increasing 
the chances of collision and therefore reducing the effective 1. The thermal 
velocity Ugms is also temperature dependent and is given by Eq. 23.6. 


1 3 
5 mMvaus = 5 Kel (23.6) 
Therefore 
(=) 1/2 
ORMS > | 
m 


which shows that vgms increases proportionally to the square root of the 
temperature. If we assume that the change of vgms with temperature is much 
larger than the change of J, we may write 

l 


— 73k,\ T7142 
ai 


Substituting this into the equation for a, Eq. 23.14, we find that the temper- 
ature dependence of the electrical conductivity should be proportional to the 
reciprocal of the square root of the temperature 


Thus, the CFE model predicts the incorrect temperature dependence of a, 
which experimentally is 0 « T~'. 
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23.3 QUANTUM-MECHANICAL 
FREE ELECTRON MODEL (QMFE) 


Many of the difficulties encountered by the classical free electron model were 
removed with the advent of quantum mechanics. In 1928, A. Sommerfeld 
modified the free electron model in two important ways: 


1. The electrons must be treated quantum mechanically. This will quantize the 
energy spectrum of the electron gas. 


2. The electrons must obey Pauli’s exclusion principle; that is, no two electrons 
can have the same set of quantum numbers. 


As a result of these modifications, when we put an electron gas in a 
solid, we begin by putting the electrons in the lowest energy states available, 
while obeying the exclusion principle, until we have used all the available 
electrons. This is to be contrasted with the classical free electron gas in which 
the electrons can assume continuous energy values, with many electrons 
having the same energy. This has profound implications for the statistical 
distribution of energies (the average number of electrons having a certain 
energy E) that the electrons can have. Thus, whereas a classical gas will obey 
Maxwell-Boltzmann statistics (Supplement 9-1, Chapter 9), the quantum- 
mechanical gas will follow a new type of statistical distribution known as the 
Fermi-Dirac distribution (Supplement 23-2 at the end of this chapter). This in 
turn will affect the way the electron gas can absorb energy from an external 
source, such as a heat source, and the way it responds to an electric field. 

Aside from these two key modifications, Sommerfeld kept most of the 
assumptions of the Drude model: 


1. The valence electrons are free to move through the solid. 


2. Aside from collisions with the ions, the electrostatic interaction between the 
electrons and the lattice ions is ignored. 


3. The interaction between the electrons is also neglected. 


Essentially, the valence electrons retain the main features of an ideal gas 
but a gas that must be treated quantum mechanically rather than classically. 


23.3a Three-Dimensional Infinite Potential Well 


If we want to treat a free electron gas quantum mechanically, the first question 
to ask is: What is the potential’ in which the electrons find themselves? We 
can start by noting the empirical fact that there are no electrons beyond the 
boundaries of the metal. There is some force keeping the electrons inside. 
This might be a large electrical potential barrier at the boundaries. And what 
about inside? How will the potential energy of an electron vary in the presence 


‘As indicated in Chapter 20 (Section 20.2), the potential energy is often referred to as the potential. 


Arnold Sommerfeld (1868-1951). 


QUANTUM-MECHANICAL FREE ELECTRON MODEL (QMFE) 


of the great number of ions and other electrons? Because we have decided 
to neglect interactions with ions and other electrons in this first approxima- 
tion, the potential energy inside will be uniform. These are, of course, sweeping 
assumptions, but the model works to a great extent. We will improve on it 
later. For now, let us consider the case of a uniform potential energy inside 
and an infinite potential energy at the boundaries of the solid so that no 
electrons can escape. 

We have previously solved in Chapter 20 the problem of a particle that 
finds itself in a potential well in which the potential energy E,(x) is given by 
the condition (see Fig. 23-8), 


eo for 0>x>a 
Ea) = {5 for OSxa 


The Schrédinger equation inside the well was given by Eq. 20.18. 


_ = “x = ey (20.18) 
or 
“x + ky = (20.19) 
where 
b= aa 


The solutions for y were vy = B sin kx, Eq. 20.23, inside and yx = 0 outside. 
When the boundary condition y = 0 at x = a was imposed, a restriction on 
the acceptable values of k and hence of E was found: Eqs. 20.24 and 20.26 


k, = ne where n = 1, 2,3,... (20.24) 
and, correspondingly, 
1h? 
Se 
En =H ra (20.26) 


The extension of the problem to three dimensions, that is, a particle moving 
in an arbitrary direction in a three-dimensional uniform potential well of 
infinitely high potential walls, is rather straightforward. Because each coor- 
dinate is at right angles to the others, their derivatives are independent. The 
Schrédinger equation in this case becomes 

2 2 2 2 
= {ex aX + St - By 


ax?” ay? az? 


met ay? (23.17) 


Although this is a more complex partial differential equation, the solution 
may be readily found by the separation of variables method that we have 
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ap oo 


FE plx) 


0 
r=U xy=a 
FIGURE 23-8 
One-dimensional infinite poten- 
tial well of width a. 
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seen previously (see Section 20.2d); that is, we assume a solution of the form 


x(x,Y,Z) = X1(X) X2{Y) X3(Z) (23.18) 


When we substitute Eq. 23.18 into Eq. 23.17, we obtain three ordinary dif- 
ferential equations for x1, X¥2, and x3, each identical to the one-dimensional 
case; that is, the solutions for y1, ¥2, and y3 will be of the form y = B sin kx 
(Eq. 20.23). Therefore, the solutions for y (x, y, z) will be 


x(x,y,z) = A (sin kyx)(sin kjy)(sin k3z) 
where 
A = B 
As before, when the boundary conditions y(x, y, z) = 0 forx = a, y = 4, 


and z = a are imposed, we obtain 


i eee 


ky 


T 
n,— where ny 
a 


kp = na — where N>p 1, 2, 3, -— 


2) 


kz, = ns — where n3 = 1, 2, 3,... 
And just as in the one-dimensional case, the energy of the electrons becomes 
quantized but with three quantum numbers specifying the state: 


ath? 


Enns = ama (nj + ns + n3) (23.19) 


23.3b Occupancy Conditions at T = 0K 


One important result from having three quantum numbers to specify the 
energy is that there may be degenerate states (states having the same energy 
but different quantum numbers). The lowest energy available will be for 
n, = 1, my = 1, and n; = 1, which we represent in simplified notation as the 
(1, 1, 1) state. The energy of this state is E = 3 Eo, where Ey = w*h7/2ma’. It 
is nondegenerate. The next higher energy level is (2, 1, 1) or (1, 2, 1) or (1, 
1, 2) for which E = 6 Ep. It is a threefold degenerate level. The next one is 
(2, 2, 1) or (2, 1, 2) or (1, 2, 2), E = 9 Eo, and so on. A summary of the first 
few levels is shown in Fig. 23-9. 

Let us now consider, for simplicity, an electron gas made up of 34 elec- 
trons. At absolute zero (T = 0 K) the gas will take the lowest energy available 
to it. 

Classically, because we know that the average energy of each electron is 
3/2 kpT, the average will be zero, and because the kinetic energy is always 
positive, this implies that each electron will have exactly zero energy. 


I di 2h? 
= - (ny + 16 + 5) 
sien 2ma- 
14-— (3, 2, 1) Sixfold degeneracy 
12-— (2, 2, 2) Nondegenerate 
11 (3, 1, 1) Threefold degeneracy 
Oo 
cy 
~ OF (2, 2, 1) Threefold degeneracy 
(eo) 
7 
a 
c 6 (2, 1, 1) Threefold degeneracy 
> 
om 
E 
ui 3e—- (1, 1, 1) Nondegenerate 
0 
FIGURE 23-9 


Schematic representation of the 
first few energy levels and the 
degree of degeneracy of each for 
an electron in a_ three-dimen- 
sional infinite potential well. 
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Quantum mechanically, the story is quite different. To begin with, E = 0 
is not an allowed energy state. The solution of the three-dimensional po- 
tential well gives the lowest energy state available as E = 3 Ey. We may be 
tempted to put all 34 electrons of our hypothetical gas in that state as the 
ground state. But this would violate Pauli’s exclusion principle (Section 21.7a), 
and therefore this arrangement is not possible. 

Let us see how we obtain the ground state for our gas of 34 electrons. 
In the state E = 3 Ey we can put two electrons: They will have the same set 
of quantum numbers, 11, Nz, 13, but their spins must be of opposite sign; one 
will have m, = 2 and the other m, = —3. In the state E = 6 Eo, we can put 
six electrons: two in the (2, 1, 1) state—these two electrons will have different 
spins; two in the (1, 2, 1) state with different spins; and finally two in the (1, 
1, 2) state—again these two electrons will have different spins. In E = 9 Eo, 
we can place six more electrons. In E = 11 Eo, we can place six more electrons. 
In E = 12 Eo, we can place only two because the energy state is nondegenerate. 
In E = 14 Ey, we can place 12 electrons, two in each of the six degenerate 
states. All 34 electrons have been placed in the lowest energy configuration. 
The ground state of the gas is one in which the electrons have energies that 
range between 3 Ey, and 14 Ey. We can state this fact as a probability statement: 
At absolute zero temperature the probability that a level below 14 Epo will be 
occupied is 1. The probability that a level above 14 Ep will be occupied is 0 
(all levels are empty). The highest filled energy level is called the Fermi energy 
at absolute zero temperature, E; (0). In our example Er (0) = 14 Ep. 

This probability statement is a particular case of the new type of energy 
distribution function that must be used to describe a quantum mechanical 
gas obeying the Pauli exclusion principle; it is known as the Fermi-Dirac dis- 
tribution function F(E) (Fig. 23-10). F(E) is the probability that a state with 
energy E is occupied by an electron. Atl = OK 


_ 0 for E > Er 
eo tt for E < Ep 


Let us attack a practical and important problem. We have considered, 
for pedagogical reasons, a gas made up of 34 electrons. How do you determine 
occupation levels when you have a gas of 10° electrons/m°, which is the 
problem that we face? The two important questions are: How do we determine 
Er for a large number of electrons? How do we find how many electrons we 
can place within a certain energy range AE? 


(23.20) 


23.3c Calculation of E-at7T = OK 


To determine the two quantities just mentioned without counting, we can 
use a standard trick of physics. Let us construct a lattice in n-space; that is, 
let us have a set of three mutually perpendicular axes. Let one of the axes 
correspond to n,, the others to m2 and nz (see Fig. 23-11). Let us plot a point 
for each integral value of 1, n2, and n3. In so doing we will get a uniform 
three-dimensional array of points. Each point in the array corresponds to a 
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FUE) 


FIGURE 23-10 
Fermi-Dirac distribution func- 
tion at T = OK. 


FIGURE 23-11 

Scheme to represent the allowed 
energy states of an electron in a 
three-dimensional infinite po- 
tential well. The axes correspond 
to the three quantum numbers 
that specify the energy of the 
electron, Eq. 23-19. A point is 
plotted for each integral value of 
Ny, Nz, and nz, that is, each point 
in this three-dimensional n-space 
corresponds to an allowed value 
of the energy. Here n, is the dis- 
tance of the occupied points far- 
thest away from the origin. 
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particular allowed value of the energy (corresponding to a particular set of 
quantum numbers nj, nz, n3). There are two facts that we must keep in mind. 


1. In a unit of volume of this n-space there is only one point. That is, we can 
fill the whole space with 1 x 1 x 1 cubes in such a way that there is one 
point at the center of each cube. 

2. As we have seen in our early example, although each point has a different 
set of quantum numbers 1, 12, n3, some of the points correspond to states 
with the same energy (they are degenerate). How do we identify them? 


Suppose that we write the energy, Eq. (23.19), as follows: 
E = E, (ni + ns + 3) = Ep rr (23.21) 


I 


where 
n= ni + ne 4+ n3 


We can identify the degenerate states by understanding the geometric sig- 
nificance of n, which is nothing else but the distance (in n-space) from the 
particular point to the origin; that is, the three-dimensional pythagorean for- 
mula with n,, nz, and n3 as the distances instead of the usual x, y, and z. 
Consequently, with a little thought we can conclude that points equidistant 
from the origin correspond to degenerate states. In addition, we see from Eq. 
23.21 that the farther away from the origin a given point is, the higher the 
energy of the state represented by that point. 

We can now proceed to answer the question: What is the Er for a gas of 
10*8 electrons/m*? Because E; (0) is the highest occupied energy level at T = 
0 K, we can write 


Er (0) = Eo n? (23.22) 


where n; is the distance of the occupied points farthest away from the origin. 
These points, being equidistant from the origin, lie on a spherical shell of 
radius n; (see Fig. 23-11). Because only positive values of n are permitted, 
only the positive octant of the sphere is used, which is one-eighth of a whole 
sphere. All points inside that shell are occupied by two electrons (correspond- 
ing to the two possible values of the spin). Therefore, we may write that 


Total number of electrons in the octant = 
(2) X (one-eighth the volume of a sphere of radius 1;) 
x (number of points per unit volume) 


But we can write that the total number of electrons is the number N of free 
electrons per unit volume of real space times the volume of the three-dimen- 
sional well, a°. Thus, 


N a? = (2) (5 x 571) (1) (23.23) 
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Solving for n; from Eq. 23.23, we obtain the value for n, 


3N a\'° 
n= (OT) 


and, substituting this value of n; and Eo = 7°h?/2ma? into Eq. 23.22, we obtain 
har? ea sd 


Er (0) = 


2ma’ aT 


We may algebraically simplify this equation to 
as h? 2\2/3 
Er (0) = om (3N 77) (23.24) 


This expression is the approximate value of the Fermi level in a solid at 
absolute zero temperature. All that is needed is N, the density of the free 
electrons. 

Example 23-2 


We showed in Example 23-1 that the number of free electrons in copper is 
8.4 x 1078 electrons/m>. (a) Calculate the Fermi energy for Cu. (b) At what 
temperature T; will the average thermal energy kgT; of a gas be equal to that 
energy? 


Solution 


(a) Substituting for the different parameters in Eq. 23.24, we obtain 


(1.05 x 10~* J-sec)* 
2x 9.1 x 10-* ke 


11.1 x 10°" J = 6.95 eV 


Er (0) = x (3 x 8.4 x 10% m=3 x 7)? 


I 


(b) We can equate this energy to kzI; to find the temperature at which 
the average thermal energy would be equal to the Fermi energy of 
Cu. 


This result illustrates the profound changes introduced by the QMFE 
model. For a classical electron gas to have thermal energies similar 
to those that a quantum mechanical electron gas has at T = 0K, the 
temperature of copper should be about 60 times higher than its melt- 
ing temperature, 1356 K. 


23.3d Density of States 


In our previous discussion, we have seen that the energies that an electron 
can take in a solid form a discrete quantized spectrum, the separation between 


375 


376 FREE ELECTRON THEORIES OF SOLIDS 


adjacent levels being (Eq. 23.19) 


If we take a macroscopic sample, for example, a = 1 cm, we may calculate 
the approximate energy difference between levels 


(1.05 x 10~* J-sec)* a? 
2 x 9.1 x 107°! kg x (107* m)” 


4x 10-8 eV (23.25) 


We have seen in Example 23-2 that the Fermi energy for copper is 6.95 eV. 
For most metals, Er is of the same order of magnitude. From Eq. 23.25, 
we conclude that the electrons in a solid occupy very closely spaced levels 
starting from very close to E = 0 eV all the way up to several electron volts. 
Thus, although the energy spectrum is quantized, the energy separation be- 
tween adjacent levels is so small that we often use the term quasicontinuous 
to describe the electronic energy spectrum of a solid. As a result, even in a 
microscopically small energy interval dE there are many discrete energy states. 
We can therefore introduce the concept of density of energy states, which will 
simplify calculations considerably. Let g(E) dE be the number of energy states 
available with energy between E and E + dE. Going back to our lattice in 
n-space, we consider an octant of a spherical shell of radius n and one of 
radius n + dn (see Fig. 23-12). All the states between these two octants have 
energies that range between E = n? Eg and E + dE = (n + dn)? Eo, Eq. 23.21. 
We choose dn so that dE is small, but because of the quasicontinuous nature 
of the spectrum, there are a large number of points within the octant shell. 
The density of energy states may be calculated in the following way. 


g(E) dE = (2) (volume between the octant shells) 
3 x (number of points/unit volume) 


AE = = 0.6 x 10-3] 


where 2 is for the two allowed spin orientations. 


9(E) dE 


(2) (5 x 4qn? in) (1) 


= qn’ dn (23.26) 


We can express g(E) in terms of the energy E instead of the state number n 
by making use of Eq. 23.21, E = n? Eo, or 


E 1/2 
"= (F) 
Differentiating n with respect to E gives 


E712 


~ EW? 


dn 


pia 
oo 
n+ an 


FIGURE 23-12 

Two octants in the n-space lattice 
of Fig. 23-11 of radius n and n + 
dn, respectively. All points be- 
tween these two octants have 
energies between E = n’E, and 
E + dE = (n + dn)Eo. 


QUANTUM-MECHANICAL FREE ELECTRON MODEL (QMFE) 377 


Substituting for n and dn into Eq. 23.26, we obtain Bib) 
g(E) dE = ee Ede 
or 
g(E) dE = CE’? dE (23.27) 
where E 
aT FIGURE 23-13 
C= E32 Density of states—that is, the 
number of energy states avail- 
Using the result from Eq. 23.19 that able to the electrons in a solid— 
as a function of the energy E. 
h2 a7 
° ma? 
the constant C has the value 
_ a3(2m)3? 
ea? (23.28) 


A plot of g(E) versus E is shown in Fig. 23-13. It is seen that the number of 

energy states 9(E) in an energy range dE increases with increasing energy. 

Although 9(E) gives the number of energy states available as a function of E, g(E) dE = CE dE 
it does not tell us the number of electrons occupying those states. To obtain 

this, we must multiply g(E) by the probability that a given energy state is 

occupied. This probability is given by the Fermi-Dirac distribution function, 

F(E) (see Eq. 23.20). If we define N(E) dE as the number of electrons with 

energy between E and E + dE, then 


N(E) dE = 9(E) F(E) dE | (23.29) 


Because at T = 0K, F(E) = 1 for E < E; and F(E) = 0 for E > Er, Eq. 23.20,  Ni# 
a plot of N(E) versus E will look like that shown in Fig. 23-14. 


Example 23-3 


Calculate the average energy per electron in a metal at T = 0 K. 


Solution We can use the standard method for the calculation of averages. 
That is, we take each value of E and multiply it by the number of electrons Er 
having that energy. We then add this product over all possible values of E 


and divide the result by the total number of electrons. In this case, the sum ee eda anak ane-clec 
becomes an integral because E varies quasicontinuously. ions 1a Sold At T= OK. that 
is, number of electrons having a 
E (T = 0) = — f° N(E) E dE certain energy E, as a function of 

Ntotal the energy E. 
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Substituting for N(E), Eq. 23.29, 


1 
N total 


doses EF 
EC =0)-= [ C EY B dE 


After substituting for C, Eq. 23.28, Er, Eq. 23.24, and Niotar = Na®, where N 
is the number of free electrons per unit volume, we obtain 


E(T = 0) = = Es (23.30) 


23.3e Energy Distribution of Electrons for T > 0 K 


So far we have confined our discussion concerning the energy distribution 
of the electrons to the case where T = 0 K. As we have seen, at T = 0K, 
the electrons, seeking the lowest energy states available to them, occupy all 
the states below E;, whereas the states above Er remain empty. 

What happens if the temperature is raised to some finite value of T? 
When an amount of thermal energy of about kgT is available to the electrons, 
they will try to move to higher energy states. But this is not possible for all 
the electrons. Because the energy available is of the order kgT, the electrons 
can jump to higher states lying up to kgT above the states they occupied at 
T = OK. For ordinary temperatures the thermal energy is much smaller than 
the Fermi energy (for example, at T = 300 K, kgI = 0.025 eV as compared 
with E; being a few electron volts). The result is that only electrons whose 
energy is within a range kgT below E; can make transitions to higher energy 
levels. There are lots of empty levels above Er. On the other hand, those 
electrons whose energy lies below E; by an amount much greater than kgT 
cannot jump to higher energy levels because these levels are occupied by 
other electrons and the Pauli exclusion principle prevents multiple occupancy. 
We can say that roughly only the fraction kgI/E; of the total number of 
electrons can be thermally excited to higher levels. Because kgT << E;, this 
represents a very small fraction. As a result, the probability of occupancy F(E) 
for temperatures below the melting point of the solid does not differ greatly 
from that at T = 0K. 

The exact mathematical expression (derived in Supplement 23.2 of this 
chapter) for the probability of occupancy of an energy state E reflects the 
qualitative argument presented here and is called the Fermi-Dirac distribution. 
The expression is 


FE) = - aya eee (23.31) 
exp [ = *) 1 
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Note that if T = 0 K and E < E;, Eq. 23.31 becomes equal to unity: 


F(E) = ——~———- = 1 


In contrast, if T = 0 K but E > E;, Eq. 23.31 is equal to zero: 


1 
F(E) = ———-—————. = 0 


AE 
exp (5) a tt 


We can also verify that if E < Er — kgI, then F(E) ~ 1. As an example, the 
probability of occupation for the state E = Er — 2kgT is 
; | 


F(E) = nove ee te 0.88 


which is not very different from the value 1 at T = 0K. Similarly, for E = 
Er + 2kgT, the probability of occupation is 
0.12 


a exp (2) + 1 7 


A sketch of the effect of temperature on the Fermi distribution is shown in 
Fig. 23-15. At temperatures above zero there is a symmetric shift of electrons 
with energies slightly below E, to states slightly above E,, as seen in Fig. 
23-15. 

The Fermi-Dirac distribution gives a new definition of the Fermi level: 
Er is the energy at which the probability that the state be occupied is 1/2. This can 
be seen by setting E = Er in Eq. 23.31. 


1 1 1 
F(E = Ep) = ——~——.——_ = = = 


2 ae & oa 
exp (FT) +1 ° 


For all T’s except extremely high T’s, Er is essentially the same as Er (0). 
An exact calculation of E;(T) yields: 


a kpT a 
ne) f "2 G a 


where the second term is small compared with unity. 


23.3f Failures of the CFE Model Revisited 
Let us now return to the two difficulties encountered by the CFE model that 
we discussed in Section 23.2c. 


FE) 


FIGURE 23-15 io 
Comparison of the Fermi-Dirac 


function at T = 0 K and at T > 
0 K. The probability of occupa- 
tion changes only for energies 
such that E — E-; ~ kpgT. At room 
temperature, kzT << E;, and the 
function is barely changed. 
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1. Specific Heat 


The classical free electron model failed to predict correctly the molar electronic 
specific heat C,. According to the CFE model (Section 23.2c) 


3 


==R 
2 


Cy 


whereas experimentally (Eq. 23.16) 
C, = 10~* RT (23.16) 


The heat required to increase AT the temperature of 1 mole of electrons is 


Heat = C, AT 
This heat input goes into increasing the internal energy of the electrons by 
AE 
AE = Heat = C, AT 
or 


AE 


ale (23.32) 


This expression for C, is correct only if C, is constant, that is, independent 
of T. If it is not (as we know to be the case here), the expression holds only 
when all the quantities involved in the definition are infinitesimally small. 

To obtain an expression for C,(T), we write the energy per mole as a 
function of T and differentiate with respect to T. This energy can be calculated 
exactly by using the density of states g(E) and the probability of occupancy 
F(E). Although this calculation can be done for T greater than 0 K, in practice 
it is a rather messy thing to do. We can get a good estimate of the value of 
E for thermally excited electrons by using a qualitative argument similar to 
the one we used before. At T = 0 K the average energy per electron is 
3/5 Er (0), Eq. 23.30. The energy of one mole of electrons, that is, N, electrons 
will be 3/5 N, Er (0). At a temperature T, only those electrons near Er can be 
excited. As we saw, for an electron with E = Er — 2ksT, the probability of 
occupation was 0.88 at T = 300 K, whereas it was 1 at T = 0 K. This means 
that the probability of being excited to higher levels is 0.12. For those electrons 
deeper down, the probability of excitation is essentially zero. As we indicated 
in Section 23.3e, we can assume as a rough estimate that only the fraction 
kgT/Er of the total number of electrons is able to gain thermal energy, and 
therefore, the number of electrons that can be excited is N, kgT/Er. The ad- 
ditional internal energy AE gained by the electron gas when T is increased 
from 0 to T is therefore 


AE = (number of excited electrons) x (average energy gained) 
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If we assume that the average energy gained per electron is kT, we may 
write 


kT 
AE = = Na) (kpT) 
Er 


Consequently, 


and from Eq. 23.32, in the case where AE and AT are infinitesimally small, 


_ dE 23 Na 7 
aT CCE 


C, 


Or, if we use the fact that ksN, = R, we obtain 


_ 2k: 


Cs E, 


RT (23.33) 


If we take Er at about 5 eV, a fairly typical value, then 


2kp 2 X 1.38 xX 10°F JK | oes 
Er 5eV X 1.6 x 107” J/eV 


and 
» = 1074 RT 


which is in agreement with the experimental results, Eq. 23.16. 

It is instructive to examine the details of the calculation of specific heat 
and try to understand why the quantum mechanical model succeeds whereas 
the classical one does not. In the classical model all the electrons are able to 
gain the thermal energy offered to them. The number is large and constant. In 
the quantum mechanical model, as a result of Pauli’s exclusion principle, only 
the fraction kgT/Er can gain energy. Because kgT << E; for the temperatures 
at which a solid remains solid, this is a small fraction of the total number of 
electrons and, moreover, the contribution to the specific heat by that fraction 
will depend linearly on T. 

The exact result for the electronic specific heat is 


kp RT = 4.93kp 


CAanais = 2 E, E; RT 


instead of 


Comal) = — RT 
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2. Electrical Conductivity 


The CFE model predicted that o x T~'*, whereas at most temperatures 
(except at very low T's) experiments show that 0 x T~*. We will now show 
that the quantized model yields the correct temperature dependence. 

When an electric field is applied to a solid, the electrons acquire a net 
drift velocity v; = g7é/m (where 7 is the collision time and € is the electric 
field), Eq. 23.12. How does this affect the velocity distribution of the quantum 
mechanical electron gas? The number of electrons with energies between E 
and E + dE is indicated in Fig. 23-16, which is a combination of Fig. 23-13 
and Fig. 23-15. The velocities of the electrons associated with these energies 
are completely random; that is, there is an equal chance of finding an electron 
with a given v, as with the same velocity in the opposite direction—the 
velocity distribution will be symmetric with respect to v, = 0. This distribution 
can be found from the definition of kinetic energy, which in this case is the 
total energy E, and the density of states (Eq. 23.27). Because E = 1/2 mv’, 
E¥2 = + Vm/2v and dE = mv dv. Substituting for E”? and for dE into Eq. 
23.27 we obtain 


g(v) dv x v* du 


The distribution for the x component of v is illustrated in Fig. 23-17. If an 
electric field is applied in the —x direction, giving rise to a force on the 
electrons in the +x direction, the entire distribution will be shifted toward 
positive velocities, this is shown in Fig. 23-18. The preceding statement implies 
that all the electrons participate in the conduction process: They all acquire 
some extra energy from the electric field. This is surprising in view of what 
we have said concerning specific heats. In that case, only those electrons near 
E, could be excited to higher energy levels. In the case of added thermal 
energy, the energy that the electrons acquire is randomly distributed among 
them and the velocity distribution is not shifted as a whole. Electrons in the 
+ side of Fig. 22-17 as well as in the — side want to go to higher energy 
states, that is, higher positive velocities if v is positive and more negative 
velocities if v is negative. But they cannot because those states are already 
occupied. Only those electrons near the Fermi level have unoccupied states 
nearby in energy. With the application of an electric field, instead of thermal 


FIGURE 23-16 


Comparison of the energy dis- 
tribution of the electrons in a solid 
at T = O K and at T > O K for 
the case where kgT << Er. 


FIGURE 23-17 ; 
x-component of the velocity dis- 


tribution of the electrons in a solid 
in the absence of an electric field. 


QUANTUM-MECHANICAL FREE ELECTRON MODEL (QMFE) 383 


energy, the energy from the field offered to the electrons is coherent and 
unidirectional: All move toward more positive velocities together. Thus, there is 
always a vacant state ready to receive an electron that is changing its state 
under the action of the electric field. The vacant state is created by the si- 
multaneous change of the velocity state of another electron. 

Although all the electrons participate in an electric conduction process, 
it is the electrons near E; that determine the scattering time 7 for all of them. 
The reason is the following. When the electrons are placed in the electric 
field, they increase their energy momentarily, then lose it on being scattered 
by the ions. However, this loss of energy is random: Not all the electrons 
lose energy at the same time, because / is a mean distance, and the deflection 
is not in the same direction. Because of this, not all the electrons can lose 
their energy on being scattered; only those that have empty lower states avail- 
able to them, that is, only those electrons near E;. Those deep down do not 
have empty lower states available to be scattered into. Do these electrons 
increase their energy indefinitely, as they do not seem to be able to lose it? 
The answer is no: The electrons ahead (higher in energy) prevent them from 
going to higher energy states. Therefore, they can only drift with the same 
velocity as the electrons near Er. Because the drift velocity is proportional to 7 
(see Eq. 23.12) the average scattering time is therefore 


T= — (23.34) 


where | is the mean free path and v;, is the velocity of the electrons at the 
Fermi level. If we use the expression for the electrical conductivity, Eq. 23.14, 


2N 
o = — (23.14) 


and substitute for 7 from Eq. 23.34, we obtain 
q°Nl 
MUFF 


(23.35) 


When an external electric field is 
applied to the electrons in the 
solid, the velocity distribution of 
Fig. 23-17 is shifted by an amount 
equal to the drift velocity v, in 
the direction opposite to that of 
the electric field. 
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The QMFE model now has a serious problem. As we have seen, Er and 
therefore vp are essentially temperature independent. It would seem that o 
is now also temperature independent unless / is a function of T. 

In the CFE model ! was basically determined by the position of the ions: 
In fact, we assumed that | was the average interionic separation. A hint that 
this assumption is not correct can be found immediately if we calculate 1, 
which is done in Example 23-4. 


Example 23-4 


The electrical conductivity of Cu at room temperature is 5.9 x 107 Q-! m7}, 
As we have shown earlier, the Fermi energy for copper is 6.95 eV (Example 
23-2) and the carrier density N = 8.4 x 1078 electrons/m? (Example 23-1). 
Calculate the mean free path of the electrons. 


Solution Write Eq. 23.34 as 
l= ort (23.36) 


The Fermi velocity can be obtained from the Fermi energy because Er = 
Yo mvs. 


4, a (zee)? _ [2.x 6.95 eV x 1.6 x 107” fev] 
Am - 9.1 x 107%! kg 


1.56 X 10° m/sec 


The scattering time 7 is obtained from the electrical conductivity. From Eq. 
23.14 


om 


—— ame 


= 7N 


— (6.9 x 107 O7* m") (9.1 x 1073! kg) 
~ (1.6 x 1079 Cj? (8.4 x 108 electrons/m?) 


= 2.50 x 10734 sec 


Substituting for 7 and for vp into Eq. 23.36 we obtain 


1 = 1.56 x 10° m/sec X 2.50 x 107“ sec = 3.90 X 10°28 m 


390 A 


This is roughly 100 times greater than the interatomic spacing. 
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The resolution of the problems faced by both the QMFE and CFE model, 
large / and the incorrect temperature dependence of the conductivity, is found 
by taking the wave nature of the electrons into account. The assumption that 
1 is the average interionic separation is fine if we consider a model of small 
marbles (electrons) colliding with large stationary objects (ions). We know 
from the de Broglie wave relation that the propagation of electrons and other 
elementary particles cannot be properly described by a particle model but 
rather is governed by a wave. We saw in Chapter 12 that a wave in a crystal 
lattice undergoes Bragg scattering, that is, it will be reflected (constructive 
interference) if the condition (see Fig. 23-19) 


2d sin 0 = na (12.11) 


is satisfied. If the wavelength is long compared with the crystal spacing, Bragg 
scattering cannot occur. The conduction electrons must be considered as waves 
that may undergo Bragg scattering by the lattice. In copper Er ~ 7 eV. This 
is the energy of the most energetic electrons. Because the de Broglie wave- 
length is A = h/p, these electrons will have the smallest wavelength, which 
is 


h 
Ar = 
PFE 
but 
_ be 
2m 
therefore 
h2 
oe 2mAz 
and 
h 
Ay = — 
* ~ (2mE,)¥2 


Poemrernrmrererir 00000 
~ (2 x 9.1 x 10 kg x 7eV x 1.6 x 10-® JieV)? 


Ar = 4.65 X 1079 m = 4.65A 


The spacing between planes of copper atoms is d = 2.09 A; therefore Ar > 
2d and from Eq. 12.11, sin 6 > 1. The Bragg condition for constructive reflection 
cannot be satisfied for any angle of incidence. This means that the electron 
waves in copper will not be reflected by the atomic planes but can traverse 


Atomic planes 


fil 
—_ 
ra 


(i -| 
| 


FIGURE 23-19 
Bragg reflection of electron waves 


by the atomic planes of the solid. 
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the crystal lattice without being scattered. As a consequence, the electrons 
would have a mean free path equal to the dimensions of the crystalline sample. 

The scattering of electron waves arises because of the deviations in the 
crystal from perfect periodicity: Such deviations include crystal lattice defects, 
impurities and, more important, the vibrations of the ions in the lattice. Except 
at very low temperatures, the main mechanism for the scattering in a pure 
crystal is the lattice vibrations. In the absence of such vibrations, the ions 
constitute a perfect lattice and, as we have seen, the mean free path of the 
electrons would be extremely large. We can represent this fact by visualizing 
the lattice ions as mathematical point particles with zero cross-sectional area, 
through which the electrons can move freely without being scattered. When 
the ions begin to vibrate, the lattice ceases to be ideal and now they offer an 
effective cross section for scattering. Clearly, the larger the cross-sectional 
area, the greater the chances of scattering and the shorter the mean free path. 
The mean path length of a moving object is inversely proportional to the 
cross-section of the target or 


| « — 
arr? 


where r, the radius of a circular target, is r? = x§ = y@ (see Fig. 23-20). 

We know from classical physics (Section 10.6) that the total energy of a 
vibrating object is E x (amplitude)’, that is, E « r?. But the energy of vibration 
comes as a result of the increase in temperature and is in fact proportional 
to the temperature, E « kzT. Therefore, 


r2 x T 
and therefore 
la JT} 
From Eq. 23.35, we see that 
2 
o =f = a To} 
MOUEF 


which is the observed temperature dependence of a. 

Although we have obtained this result by using a judicious mixture of 
simple wave and kinetic arguments, a more rigorous (and much more com- 
plicated) treatment based solely on the behavior of electron waves and on 
the theory of lattice vibrations leads to the same conclusion. 

The QMFE model has removed most of the problems that we encountered 
in the CFE model, but neither has answered one of our initial questions: 
Why do the values of o vary over such a wide range in different types of 
materials? 

This leads us into the topic of our next chapter: The Band Theory of 
Conduction. 


FIGURE 23-20 
Scattering cross-section pre- 
sented by a vibrating lattice ion 
to an incoming electron. 


SUPPLEMENT 23-1 
The Wiedemann-Franz Law 


The CFE model was able to predict Ohm’s law. It was also able to explain 
why good electrical conductors are also good thermal conductors (conductors 
of heat). Specifically, an empirical relation, known as the Wiedemann-Franz 
law, had been established in 1853 that relates the electrical and the thermal 
conductivities 

Se (23.37) 


where 7 is the thermal conductivity (to be defined following), o the electrical 
conductivity, T the absolute temperature, and £& is called the Lorentz number 
(a kind of universal constant, roughly the same for all materials), which has 
an average value of 2.3 x 1078 W-Q/K?. 

Let us examine the conduction of heat by electrons in a metal. Consider 
a thin slab, as in Fig. 23-21, of thickness Ax, cross-sectional area A, and let a 
temperature difference AT = Ty, — Tc exist across Ax. Let Ty and Tc represent 
the temperatures of the hot and cold sides, respectively. 

Experimentally one observes that heat energy is transferred from the hot 
to the cold side of the slab. Moreover, the heat current ipeat (heat per unit 
time) is found to be proportional to the temperature difference AT between 
the two sides and to the cross-sectional area A and inversely proportional to 
the thickness Ax of the slab. The proportionality can be converted to an 
equality by introducing a constant 7, which is called the thermal conductivity 
of the material. That is, 


theat = nA A ae 
As in the case of electrical conduction, we define a heat current density 


_ lheat 


J heat ~~ A 


which is the heat transferred per unit time per unit area. The thermal con- 
duction equation becomes 


AT 
Theat = 7) Ax (23.38) 


We can derive an expression for 7 based on the assumption that the 
agents that transfer the heat from one side to the other are the same that 
transfer the electric current when a potential difference exists across the sam- 
ple, namely, the conduction electrons. 

Consider, as shown in Fig. 23-22, a region filled with an electron gas 
with N electrons per unit volume. Let Ugms be the average thermal speed and 
let | be the mean free path between collisions. Let the thermal energy at the 
central plane of this region be E and the gradient of the thermal energy be 
— dE/dx from Ty; to Tc. The thermal energy of the electrons at a plane | away 
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Ty 


FIGURE 23-21 
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from the central plane (on the hot side) will be E + (dE/dx) 1 and the energy 
of those at a plane / away (on the cold side) will be E — (dE/dx) 1. The reason 
for choosing planes | away from the central plane is so that the electrons can 
reach and cross the central plane without colliding with the ions; therefore, 
as they cross the central plane they will have the same energy that they had 
when they were a distance | away from it. 

Let us evaluate how many electrons will cross the central plane in 1 sec. 
For the moment, let us assume that all the electrons to the left of the plane 
are moving toward it with a velocity Ugms. If the cross-sectional area of the 
plane is 1 m’ and 7 is the time to travel a distance / with an average speed 
of Vrms in the direction of the plane, then in time 7 all the electrons in the 
volume (1 m7’) (Upms 7) will cross the plane. In a time of 1 sec all the electrons 
in the volume (1 m7’) Upws (1 sec) will cross the plane, and if the electron 
density is N, then in 1 sec the number of electrons crossing the plane will be, 
by this model, Nvgms. But this model assumes that all the electrons are moving 
perpendicularly toward the plane, whereas on the average only one sixth of 
them have that motion. This is because the number of vector directions is 
+ x, + y, and + z for a total of 6. With this modification we may write 
that the electrons from the hot side will carry a thermal current density to- 
ward the plane from the left of 


1 dE 
+= = ——= 
J 6 Nvems (E + Ay ) 


The electrons from the cold side will carry a thermal current density to the 


plane from the right of 
dE 


1 
J> = g Norms (E = =) 
The net thermal current density will be the sum of the positive and negative 


currents 
Jneat ia J we ee J 7 


_ Nups | dE 


neat = 3 (23.39) 


The reason why there is an energy gradient dE/dx is that there is a tem- 
perature gradient dT/dx. We can relate these two by the standard mathematical 
chain rule, 

aE _ aE aT 
dx dT dx 

In Section 9.8 we introduced the molar specific heat at constant volume 
C,, which was defined as the heat energy needed to raise 1 mole of a substance 
by 1K. The heat energy needed to raise the temperature of 1 mole of electrons 
by dT is C, dT, and the increase in the energy dE per electron will be 


C 
= — dT 
uae | 


where N, is Avogadro’s number. 


Ty 0 Tc 


FIGURE 23-22 

Three hypothetical planes hav- 
ing different temperatures. Since 
the electrons at the plane labeled 
Ty have a higher thermal energy 
than those in the plane labeled 
Tc, there will be a net thermal 
current J* — J~ (thermal energy 
per unit time per unit area) across 
the central plane labeled 0. 


Substituting C,/N, for dE/dT in the chain rule, and then substituting the 
result in Eq. 23.39, we obtain the relation 
] _ Nogos ! C, dT 
heat — 3 Na dx 


Comparing this expression with the empirical one, Eq. 23.38, we see that 


_ Noegms C, _ Nvgms TC, 


an. a (23.40) 


In the last step we made use of the CFE model result, Eq. 23.15, | = vpms T. 
We can find the ratio of the thermal and electrical conductivities by dividing 
the expression that we have found for n, Eq. 23.40, by the expression for a 
that we found earlier, Eq. 23.14 


Nvégms T Cz 
1) 3Na 
o Nq? + 
Mm 


7) MUMS C 
ee 23. 
a 3gN, (23.41) 


The molar specific heat at constant volume is given by Eq. 9.32: 


3 


Substituting Eq. 9.32 for C, into Eq. 23.41, we obtain 


nN mvzgms R _ mvems kp 
— 29?N, = 2g (23.42) 


In the last step, we made use of the fact that R/N, = kg. Because 
1/2 mvgus = 3/2 kgT, (Eq. 9.21), Eq. 23.42 can be written as 


3 (ke\? 
- =5 (*) i (23.43) 


which is the Wiedemann-Franz law, Eq. 23.37. We obtain the theoretical value 
for the Lorentz constant by simple substitution 


3 (kp\? _ 3 x (1.38 x 107% J/K)? 
7 g 2 x (1.6 x 10-19 C) 


= 1.12 x 10°78 W-0/K? 


whereas the experimental value is £ = 2.3 x 107% W-O/K’. Although the 
derived value of £ is roughly one half the experimental value, the theoretical 
value is nevertheless of the same order of magnitude. This is quite good 
considering the many simplifying assumptions and omissions of the CFE 
model. 
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SUPPLEMENT 23-2 


Fermi-Dirac Statistics 


In Chapter 9 we saw that the Maxwell-Boltzmann distribution gives the rel- 
ative probability of occupancy of two states 7 and j with energies E; and E; by 
Eq. 9.40 as 


= = exp [— (E;) — E\/keT] (9.40) 


j 
Eq. (9.40) contains the implicit assumption that the presence of one or several 
particles in an energy state does not prevent other particles from occupying 
that state. In a crystal, because of the exclusion principle, only two electrons 
may occupy each state, and we need a different statistical probability function. 
Although these new statistics may be derived more rigorously by methods 
of statistical mechanics, we will give here an elementary demonstration that 
the statistical function we choose does satisfy the Pauli principle. We will 
ignore the fact that two electrons may occupy each state and assume that 
only one can. It can be done for two electrons, but the factor of two thus 
introduced will cancel at the end anyway. 

If two particles with energies E,; and E> collide, they may then have 
energies E; and E,, respectively, in a Maxwell-Boltzmann distribution. But in 
a distribution subject to the Pauli principle such a collision may take place 
only if energy states E; and E, are unoccupied. If F(E;) is the probability that 
energy state E; is occupied, then [1 — F(E;)] is the probability that energy 
state E; is vacant. In this collision the joint probability that both states E, and 
E, are occupied is the product of the two individual probabilities, F(E,) F(E2), 
and the probability that the final states are vacant is the product of their 
individual probabilities [1 — F(E3)][1 — F(E,)]. The joint probability that such 
a collision may occur is the product of all four individual probabilities, and 
by introducing a constant c that contains terms such as density of particles, 
cross-section, and velocity, we may write an expression for f, the number of 
this type of collision per second as 


f = cF(E,)F(E2) [1 — F(Es)][1 — F(E4)] (23.44) 
The rate of the reverse type of collision f’ may be written as 

f’ = cF(E3)F(E,) [1 — F(E,)][1 — F(E2)] (23.45) 
Under equilibrium conditions the forward and the reverse collision must be 


occurring at the same rate. We must therefore equate Eqs. 23.44 and 23.45. 
If we do so and divide by cF(E,)F(E2)F(E3)F(E4) we obtain 


1 — F(E3)] [1 — E(E)] _ [1 - F(E)] [1 - F(E,) 
F(Es) lI F(Es) |- FE) i FE.) (28.46) 


Enrico Fermi (1901-1954). 


\a ff 


Paul Adrien Dirac (1902-1984). 


Because energy is conserved in the collision process, if we write 6 as the gain 
in energy of the particle with initial energy E,, then —6 must be the loss 
in energy of the particle with initial energy E,. Therefore, we may write 
FE; = E, + 6 and E, = E, — 6. Substituting these expressions in Eq. 23.46 
we obtain 


1— FE, + 8)| [1 - FE. - 8] _ [1 - FE) | [1 - FE) 
F(E, + 6) || F(E, — 8) F(E;) || F(E>) (23.47) 


The relation of Eq. 23.47 can be satisfied with the substitution of 


1 — E(E; | 
Se ee C BE; 
F(E,) 
or 
F(E) = ——L-— 23.48 
OO Ty Cone ila 


where C and 8 are arbitrary positive constants. We may show that Eq. 23.48 
satisfies the relation of Eq. 23.47 by direct substitution: 


Cenee % OC eere Ol Cen Cer (23.49) 


Because the 6 terms cancel, Eq. 23.49 is an identity, and Eq. 23.48 is a solution 
of Eq. 23.47. 
Thus, the probability of the ith state being occupied is 


F(E) = where F(E;) < 1 (23.50) 


1 
Cee aa 
For large values of E;, C e8*' >> 1, so we may ignore the 1 in the denominator 
and write 


1 
FE) = ae (23.51) 


Equation 23.51 is the Maxwell-Boltzmann distribution, Eq. 9.39, with B = 
1/kgT. Because C is an arbitrary mathematical constant, let us write it as C = 
e~ Fr«sT and substitute this value and B = I/kgT in Eq. 23.50. We now have 


(23.52) 


Equation 23.52 is usually written without the 7 subscript and is called the 
Fermi-Dirac distribution and the constant E; is called the Fermi energy. It applies 
to condensed systems in which the Pauli exclusion principle is obeyed. 
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PROBLEMS 


23.1 Consider a copper wire of cross-sectional area 
A = 3 mm’. (a) What would the current through 
the wire be for the drift velocity to be comparable 
to the thermal velocity at room temperature, that is, 
vg ~ 10° m/sec? (b) Would Ohm’s law be obeyed in 
this case? Explain. 


23.2 Gold is monovalent. The atomic weight and 
the density of gold are 197 g/mole and 19.3 g/cm%, 
respectively. Calculate the number of free electrons 
per unit volume. 

(Answer: 5.87 x 1078 m~°.) 


23.3 The electrical conductivity of copper at room 
temperature is 5.9 x 107 Q~'m~?. (a) What is the 
thermal conductivity of copper at room tempera- 
ture? (b) What is the thermal conductivity at 1000 K? 

(Answer: (a) 407 W/m-K, (b) 407 W/m-K.) 


23.4 The electrical conductivity of copper at 300 K 
is 5.9 x 107 Q~'m~!. (a) What should be the con- 
ductivity at 77 K according to the CFE model? 
(b) How does it compare with the experimentally 
measured value of 4 x 108? Q~!m7™!? 

(Answer: (a) 1.2 X 108 Q~1m~?.) 


23.5 The electronic energy levels in the three- 
dimensional infinite potential well are given by Eq. 
23.19, E(ni, no, n3) = Eg (nz + ns + n3). Find the 
fractional difference in the energy between the pairs 
of states (a) E(1,1,1) and E(1,1,2), (b) E(10,10,10) and 
E(9,10,11) (c) E (100,100,100) and E (99,100,101). 
(d) What conclusion can you draw from these re- 
sults? 


23.6 Magnesium is a bivalent metal with an atomic 
weight of 24.32 g/mole and a density of 1.74 g/cm’. 
(a) What is the free electron density? (b) What is the 
Fermi energy? (c) Calculate the Fermi velocity. 
(d) What is the de Broglie wavelength of the elec- 
trons at the Fermi level? 

(Answer: (a) 8.4 x 1078 m~°, (b) 7.07 eV, 
(c) 1.58 x 10° m/sec, (d) 4.62 A.) 


23.7 The Fermi energy of cesium is 1.55 eV. 
(a) Determine the number of electrons in 1 cm* by 


free electron theory. (b) How does this compare with 
the number of cesium atoms in 1 cm*? (The density 
of cesium is 1.87 g/cm® and the atomic weight is 
133 g/mole). 


23.8 Ina Hall effect experiment like the one dis- 
cussed in Section 16-7, a rectangular gold slab 1 mm 
thick carries a current of 20 A. The magnetic field B 
= 1.2 T. It is found that the Hall voltage Vy = 
2.6 wV. (a) What is the free electron concentration? 
(b) How does the answer to (a) compare with that 
obtained by the method used in Problem 23.2? Gold 
is monovalent with molecular weight 197 g/mole and 
density 19.3 g/cm’. 
(Answer: (a) 5.77 X 1072 m7, 
(b) 5.87 xX 107° m~3.) 


23.9 When the sun stops producing energy by fu- 
sion it will collapse and become a white dwarf star 
with radius about that of the earth. The mass of the 
sun is 2 x 10° kg and the radius of the earth is 
6.37 x 10° km. Assume that there is one electron 
for every two nucleons. (A nucleon is the general 
name for the constituents of the nucleus, protons 
and neutrons). What will be the Fermi energy of the 
electrons in that white dwarf star? 

(Answer: 2.43 X 10° eV.) 


23.10 A neutron star is a giant nucleus composed 
of neutrons. Neutrons obey the Pauli exclusion prin- 
ciple and therefore have a Fermi-Dirac distribution 
of energies. Consider a neutron star of mass 4 xX 
10°° kg and radius 10 km. What is the average energy 
of the neutrons in that star? 


23.11 (a) Estimate how much energy would be re- 
leased by the conduction electrons in a 1 m? piece 
of copper if we could suddenly turn off the Pauli 
exclusion principle. (b) How long could a 100-W lamp 
be lit with this amount of energy? (The free electron 

density in copper is 8.4 x 107° electrons/m’.) 
(Answer: (a) 3.5 X 10°? eV, (b) 5.6 X 108 sec or 
17.6 years.) 


23.12 Let us assume that kinetic theory applies to 
a quantum-mechanical electron gas. (a) What would 


be the pressure of the electron gas at T = 0 K? (Hints: 
See Section 9.5, Eq. 9.18 and recall that the average 
energy of an electron obeying Fermi-Dirac statistics 
is 3/5 Er.) (b) Calculate the pressure for copper 
(Er = 6.95 eV, N = 8.4 x 107 m7”). 

(Answer: (a) 2/5 NEg, (b) 3.7 x 10'° N/m?.) 


23.13 Estimate the fraction of free electrons in cop- 
per that are in excited states at room temperature. 
The Fermi energy of copper is 6.95 eV. 


23.14 If the Fermi energy for a given metal is 1 eV 
and the metal melts at 2000°C, approximately what 
fraction of the electrons are in excited states at the 
melting point? 


23.15 The Fermi energy of silver is 5.1 eV. If the 
temperature is 300 K, what is the probability that a 
state be occupied for the following energies: (a) E = 
5 eV, (b) E = 5.2 eV, (c) E = 4 eV, and (d) E = 
6 eV? (e) At what temperature will the probability of 
occupation for the state of energy E = 5.2 eV be 0.1? 
(Answer: (a) 0.979, (b) 0.021, (c) 1, (d)9 x 107", 
(e) 527 K.) 


23.16 In the Fermi distribution let AE = E — E;. 
Calculate F(E) for AE = 2 kgT, 4 kgT, 10 kgT. 


23.17. Show that the probability that a state with 
energy E = E,; + AE be occupied is equal to the 
probability that a state with energy E = Erp — AE 
be empty. 


23.18 Let us define the Fermi temperature T; = 
E-/kp, where kz is the Boltzmann constant. What is 
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T; for silver? The Fermi energy of silver is 5.1 eV. 


23.19 The electronic molar specific heat of copper 
is C, = 1.65 x 10-4 T calories/mole-K. Find Er from 
that information. 

(Answer: 5.1 eV.) 


23.20 The electrical conductivity of aluminum, 
a trivalent metal, at room temperature is 3.8 x 
107 0-1! m~!. The atomic weight and the density of 
Al are 27.0 g/mole and 2.7 g/cm’, respectively. What 
is the mean free path for the electrons in Al at room 


temperature? 
7 (Answer: 151 A.) 


23.21 Acollection of N atoms have available to them 

two energy levels: the ground state E; and an excited 

state E,. Assume that the atoms obey the Maxwell- 

Boltzman distribution and that Ez — E,; = AE >> 

kgT. (a) What fraction of the atoms are in the excited 

state at a temperature T? (b) What is the average 

energy of the atoms at that temperature? (c) What 

is the total energy of the atoms? (d) Find the molar 
specific heat of the system. 

(Answer: (a) exp (— AE/ksT), 

(b) E, + AE exp (— AE/kzT), 

(c) N [E, + AE exp (— AE/ksT)], 

(d) R (AE/kgT)* exp (— AE/kzT).) 


23.22 Can photoelectrons be emitted thermally? If 
the work function of a metal surface is 1 eV, calculate 
the temperature the electron must have to escape in 
the dark. What fraction of electrons in a solid have 
enough energy to escape at room temperature as- 
suming that the electrons have the Maxwell-Boltz- 
mann distribution of energies? 
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24.1 INTRODUCTION 


The free electron models of metals that we have presented in the preceding 
chapter gives us a good deal of insight into several properties of metals. Yet 
there are many other important properties that these models do not explain. 
In particular, they do not tell us why, when chemical elements crystallize to 
become solids, some are good conductors, some are insulators, and yet others 
are semiconductors with electrical properties that vary greatly with temperature. 
These differences are not minor, but rather remarkable. The resistivity may 
vary from p ~ 10~® ohm-m for a good conductor to p ~ 10” ohm-m for a 
good insulator. 3 

We can understand the differences between insulators and conductors 
by extending the free electron model to take into account the interaction of 
the electrons with the positive ion lattice. In the quantum mechanical free 
electron (QMFE) model, we assumed that the potential energy inside the solid 
was uniform. It would be more realistic to assume that it is a periodic (alter- 
nating uniformly) function of x, y, z. This is reasonable because of the periodic 
distribution of the lattice ions in a crystalline solid. 

When the interaction between the electrons and the lattice ions is con- 
sidered, we will find some unusual properties possessed by the electrons in 
the crystal. 


1. In the last chapter we saw that the QMFE model gave rise to a series of 
discrete energy states, about 10~*° eV apart, which ranged from E ~ 0 to 
E = E;, the Fermi level. This range of energy levels can be called a band 
of energies, and the energy levels are so close together that they are referred 
to as quasicontinuous. In the band of the QMFE model there was an infinite 
number of unoccupied energy states above the Fermi level that could be 
occupied by excited electrons. When we introduce the potential of the 
lattice ions, we will see that bands of this type have upper and lower limits 
of allowable energies. If a band is not filled with electrons, then the elec- 
trons may be excited into the empty states and contribute to electrical or 
thermal conduction. If, however, a band is filled, then there are no states 
to be occupied and the electrons cannot be excited. For conductors, we 
will see that the behavior of the electrons in the occupied band of highest 
energy is almost identical to that predicted by the QMFE model, but 
not so for insulators and semiconductors. The band theory model will 
solve the questions concerning the differences between conductors and 
insulators. 


2. The electrons respond to an externally applied electric or magnetic field 
as if they were endowed with an effective mass m*, which may be greater 
or smaller than that of the free electron, or even be negative. By this we 
mean that the electrons can be treated as free in responding to an external 
electric and magnetic field provided that we assign to them a mass different 
from the true mass. 


3. There are situations in which instead of conduction by electrons it is con- 
venient to attribute the conduction to charge carriers with a positive charge 
+e, called holes. 


There are several methods (or models) to show the existence of bands 
and to find the shape of the band. Some work well for certain materials, some 
with others. Quantitative band calculations must take into account the par- 
ticular crystal structure, atomic configuration, and type of bonding: These 
details belong to the realm of solid-state physics research. Our purpose here 
is simply to show the existence of bands and the general characteristics. This 
can be achieved with idealized models and by using qualitative arguments. 


24.2 BLOCH’S THEOREM 


Before we proceed to study the motion of an electron in a periodic potential’, 
we should mention a general property of the wavefunctions in such a periodic 
potential. 

For a free electron with E, = constant, the space part of the wave function 
w (x, t), called the eigenfunction x(x), is written as (see Eq. 20.4) 


x(x) = e=m 


If the spacing of the ions in the x direction in a solid is d, then the potential 
energy of an electron at a point x distance from the origin is equal to the 
potential energy at a point x + d from the origin. This potential energy is 
equal in turn to that at point x + 2d from the origin, and so on. Therefore, 
we can generalize and take any point x in the lattice and state that the potential 
energy at that point is equal to the potential energy at point x + d or, stated 
mathematically, E,(x) = E,(x + d). This is known as a periodic potential. There 
is a theorem by Bloch which states that for a particle moving in a periodic 
potential, the eigenfunctions x(x) are of the form 


x(x) = u(x) e* ™* (24.1) 
where 
ux) = u(x + d) 


These eigenfunctions are plane waves modulated by a function u;,(x), where 
u,(x) has the same periodicity as the potential energy. Because the potential 
energy E,(x) = E,(x + d), one expects that the probability of finding a particle 
at a given x is the same as that of finding it at x + d. This is guaranteed by 


‘As indicated earlier, in Chapter 20, in quantum mechanics the potential energy is often called the 
potential. 
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the periodicity of u, and can be seen in the following expression for the 
probability density. 


ux(x) e~ ™ u,(x) e* 


X"(x) X(x) 


Ux(X) U(X) 
Therefore, when 
U(x) = u(x + a) 
then 
X"(x) x(x) = Xx + d+) x(x + A) 


The specific form of the function u;(x) will depend on the form of the function 
E,(x). We will now consider an idealized, one-dimensional periodic potential. 


24.3 THE KRONIG-PENNEY MODEL 


Let us try to understand what the potential energy of an electron in a crys- 
talline solid may look like. Consider a positively charged ion q and an electron e 
at a distance x from g as shown in Fig. 24-1. The electric potential energy 
from the coulomb attraction experienced by the electron is (Eq. 14.9). 


1 del 


Ep(x) = — Ame) x 


The variation of E, with x is illustrated in Fig. 24-1. Suppose we now place 
another charge q at a point d away from the first. The potential energy E, at 
any point on the x axis will be equal to the algebraic sum of the potential 
energies due to each individual charge, as illustrated in Fig. 24-2. The dashed 
lines represent the potential energy due to the individual q’s and the solid 
lines represent the sum of the dashed lines. If we now place a long array of 
g's separated by a distance d from each other to form a periodic array, the 


E, = 0 ee 


FIGURE 24-1 
Potential energy E, associated 
with the coulomb interaction of 
an electron with a positive ion 
+g as a function of the separa- 
tion x between the electron and 
the ion. 


} 
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potential energy E, looks like that shown in Fig. 24-3. The main features of 
the potential energy in Fig. 24-3 are: (1) it is periodic with a period d, (2) the 
maxima are halfway between the ions, and (3) the potential energy tends to 
~ © as the position of the ions is approached because the electron is bound 
more strongly to the ion as it comes closer and, because it takes more energy 
to pull it away, it can be said to lie in a deeper potential energy well the closer 
it is to the ion. 

If one tries to solve the Schrédinger equation for such a potential, one 
runs into mathematical difficulties that are best solved by a computer. How- 
ever, we can replace the potential energy of Fig. 24-3 with one that is math- 
ematically simpler to handle while retaining the essential features of the actual 
one. We replace the potential energy of Fig. 24-3 by one consisting of pe- 
riodically spaced rectangular wells as shown in Fig. 24-4. The potential energy 
is a series of rectangular wells of width c, spaced a distance b apart so that 
the periodicity d = b + c. The energy of the wells is — Exo. However, it is 
convenient to shift the zero of potential energy so that the bottoms of the 
wells are at potential energy E, = 0 and the tops are at E, = E,o. The potential 
energy of Fig. 24-4 has the same periodicity as the lattice; the potential energy 
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FIGURE 24-3 
Potential energy of an electron in a one-dimensional array of periodically spaced ions. 
The periodicity of the ions is d. 
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FIGURE 24-2 

Potential energy E, when an 
electron interacts with two ions 
(solid lines). The dashed lines 
correspond to the potential en- 
ergy associated with the inter- 
action of the electron with the 
individual ions. The solid lines 
represent the sum of the dashed 
lines. 
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FIGURE 24-4 


Periodic array of rectangular potential energy wells of depth E,,, width c, separated 
by a distance b. The periodicity of the wells is d = b + c. Such an array of potential 
energy wells can be used to approximate the potential energy of Fig. 24-3. 


is lower in the vicinity of the ions and highest between the ions. This potential 
energy model is known as the Kronig-Penney model. 

To find the behavior of the electrons in such a periodic potential, we 
have to find the eigenfunction that we can associate with them by solving 
the Schrodinger equation. We will not show all the mathematical details, but 
instead we will outline the method in order to understand the origin of the 
final result. 

Because E,, is either 0 or E,9, we solve for y separately in both regions I 
and II. We then impose the conditions of continuity for y and dy/dx discussed 
in Chapter 20, while meeting the periodicity requirements. We will consider 
the case where E < E,jo. 


Region | 
In region I E, = 0 and the Schrédinger equation is written as (Eq. 20.18) 
ay 
2m dx? 4 


where x; is the eigenfunction in region I. Rearranging terms, we get 


dy, 2mE 


ae + yx, = 0 where y = ~— (24.2) 


From Bloch’s theorem, y; = ;(x) e* (Eq. 24.1). If we substitute this y in Eq. 
24.2, we get a differential equation for uy, 


+ Qik + (y* — Bu = 0 


The solution of this equation can be found by the methods that we used 
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before in Chapter 20 and is 
u(x) = A ety ~ H* + Be ty + hx 


where A and B are arbitrary constants. 


Region Il 
In region II E, = E,9 and the Schrédinger equation is written as 
h> d* 
aa, re ai + E,oxXn = E xn 


where x is the eigenfunction in region II. Rearranging terms, we get 


2m(E,o — E) 
h2 


d Vi 

ao €*yn = 0 where é = | 
If we substitute yy = uy(x) e**, we will get a differential equation for uy that 
can be solved by the same method yielding for uy 


Uy = Ceé - ®* + De -€ + ivx 


where C and D are arbitrary constants. 


The next step is to impose the requirements of continuity and periodicity, 
between the regions I and II. It is seen in Fig. 24-4 that regions I and II join 
at x = c/2, therefore, following the discussion in Section 20.2e, we recall that 
both the eigenfunctions and their first derivatives must be continuous across 
a boundary; mathematically these criteria are 


axr (C\ _ axn (¢ 
dx \2 dx \2 


In addition, the periodicity requirements must be satisfied. This can be done 
by choosing points separated by the period of the lattice d, such as x = 
— c/2 and x = b + c/2. When we substitute Bloch functions for the y func- 
tions, that is, y = u(x) e***, the periodicity requirement on the function u(x) 
yields 


These four conditions on the eigenfunctions lead to four linear algebraic equa- 
tions for the constant A, B, C, D. In solving these equations, it is found that 
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a solution exists only if? 


pS cos yd = cos kd (24.3) 
yd 
where 
E 
oe and, from Eq. 24.2, y = = 


P= 


h? h? 


(Note: This P is a new term with no relation to momentum). If this condition 
is not satisfied, the boundary conditions on y cannot be satisfied, and the 
corresponding y’s are not acceptable solutions. In arriving at Eq. 24.3 we used 
one of the forms of Bloch functions, namely, y = u(x)e***. The same result 
will be obtained if we use the other form, that is, y(x) = u(x) e~™. 


24.3a Allowed and Forbidden Energy Bands 


The main result of the solution of the Schrédinger equation for the periodic 
potential of Fig. 24-4 is that the only acceptable y’s are those for which Eq. 
24.3 holds. 

Let us try to understand the significance of Eq. 24.3. Remember that 


2mE 
= K2 


Therefore y is a measure of the total energy, E, whereas k is the wave vector 
k = 2m/X. From de Broglie’s hypothesis, Eq. 19.1, A = h/p and therefore p = 
kh. Therefore k is a measure of the momentum of the particle. Equation 24.3 
relates the total energy E of the electron to its momentum p. (The correspond- 
ing expression for the free particle is E = p*/2m = k*h?/2m.) Finding a direct 
analytical expression E(k) is not possible because Eq. 24.3 is a transcendental 
equation that cannot be solved analytically. We can, however, solve it nu- 
merically: We choose a y (an E), insert it into the equation, and solve for k. 
This is not difficult; in fact, it is a simple exercise in computer programming. 
When we do this we will find a rather interesting result. There will be ranges 
of y for which k will be a real number. Those ranges of y will be separated 
by other ranges for which k is imaginary. The momentum of a particle cannot 
be imaginary, and the conclusion is as follows: A particle in this periodic 
potential cannot have values of y for which k is imaginary; therefore, the 
corresponding values for E for these y’s are not allowed. 

We do not have to go through the tedious task of selecting a y and 
substituting it into the Eq. 24.3 to show this. We can get the result by qual- 


“In arriving at Eq. 24.3, an additional simplification is made, namely E, is assumed to approach 
infinity and b to tend to zero while their product E,ob remains constant. 


P 


sin yd 


ya 


- + cos ya 


a 


cos kd 


sin (yd) 


itatively plotting the left side of Eq. 24.3, calling it f(yd). 
sin yd 
Y 


For example, let P = 5/2 7. The sine function is periodic, as in Fig. 24-5, and 
P/yd behaves as in Fig. 24-6. When we multiply these two functions to get 
the first term of f(yd), an oscillating function similar to sin yd results, but the 
amplitude will decrease with increasing yd. Some values are shown in Table 
24-1. 


f(yd) = P + cos yd 


TABLE 24-1 


Between 0 and z we have to be careful, particularly close to yd = 0, because 
when yd = 0, P sin yd/yd = P 0/0, which is undetermined. We can, however, 
use the L’Hopital rule on limits that 


d 
— sin x 
sin Xx _ ax cos x 
lim = lim = = ] 
x—0 Xx x—0 d x—0 
—x 
dx 


Thus, in the limit as yd — 0, sin yd/yd — 1. As yd begins to increase from 0, 
both sin yd and yd increase; however, the ratio decreases. If we express yd 
in radians, we can easily show with a calculator that sin yd/yd is a decreasing 
function of yd, which becomes 0 when yd = 7z. We can put all these facts 
together to obtain the plot of the first terms of f(yd); this is shown in Fig. 
24-7. Note that the larger P, the greater the slopes will be, because the zero 
positions are fixed. To get the entire function f(yd), we must add to Fig. 24-7 
the term cos yd, Fig. 24-8. Between 0 and 7 both functions decrease; therefore, 
f(yd) decreases and becomes — 1 when yd = 7. After 7, the first term continues 
decreasing, while the second begins to increase. Because cos yd changes 
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FIGURE 24-5 


ya 


FIGURE 24-6 
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sin ya 


slowly near the maximum and minimum, f(yd) continues to decrease below 
—1. Somewhere between z and 3/2 7 the trend reverses itself and f(yd) begins 
to increase, reaching the value of +1 at 27. After 27 the first term of f(yd) 
continues increasing while the second decreases. Again, just as before, the 
first term increases at a faster rate than the rate of decrease of the second 
term, and as a result f(yd) continues (for a while) to increase past + 1. Some- 
where between 27 and 5/2 7 the trend will reverse. 

These arguments are reflected in a plot of f(yd) versus yd, Fig. 24-9. The 
most important fact to note is that there are ranges of yd (shaded regions) for 
which the values of f(yd) vary between +1 and —1. These ranges of yd are 
separated by others for which f(yd) is either greater than +1 or less than —1. 
The width of the shaded region, the ranges of yd for which f(yd) varies between 
+1 and —1, increases as yd increases. The condition that had to be satisfied 
for the solutions to the Schrédinger equation to be acceptable was, Eq. 24.3, 


f(yd) = cos kd 


cos yd 


FIGURE 24-7 
Plot of the first term of the left 


vq Side of Eq. 24.3 as a function of 


yd. The plot, obtained by mul- 
tiplying the sine function of Fig. 
24-5 by the function of Fig. 24-6, 
is an oscillating function of yd 
with decreasing amplitude. 


FIGURE 24-8 


THE KRONIG-PENNEY MODEL 405 


flyd) 


FIGURE 24-9 

Plot of the function f(yd), that is, 
the left side of Eq. 24.3 versus yd 
(oscillating solid line). The shaded 
areas represent the values of yd 
for which f(yd) varies between 1 
and —1. These values of yd cor- 
respond to allowed energy val- 
ues. They are separated by ranges 
of yd for which f(yd) is either 
greater than 1 of less than —1. 
These values of yd correspond to 
forbidden energy values. 


Forbidden +'s Allowed +’s 


+] 


Because cos kd takes values ranging from +1 to —1, this means that this 
condition can be satisfied only by those values of yy for which f(yd) lies within 
those limits. The values of y for which f(yd) is outside these limits correspond 
to y’s for which the boundary conditions cannot be satisfied and, therefore, 
these y’s (and the corresponding E’s) are not physically acceptable. We con- 
clude that the electron may possess energies within certain bands of energy but not 
outside of them: There are allowed and forbidden bands of energy available to electrons 
moving in a periodic lattice. 

Another conclusion to be drawn from Fig. 24-9 is that the width of the 
allowed energy bands increases with increasing y (increasing energy E). The 
physical reason for this will become clear when we look at an alternative way 
of showing how the bands come about in Section 24.4. 


24.3b Dispersion Relation 


For a free particle the relation between the energy E and the momentum p is 
E = p?/2m. From de Broglie’s relation p = h/A (Eq. 19.1), and because A = 
2a/k (Eq. 11.12), it follows that 


Substitution of this relation for p into the energy relation shows that the energy 
of the particle can be expressed in terms of the wave vector as 


E=— (24.4) 


The relation between the energy of a particle and its wave vector is often 
referred to as the dispersion relation. For a free particle, this relation is parabolic; 
that is, E « k?, Eq. 24.4. This dependence is illustrated by the dashed lines 
of Fig. 24-10. 
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When the particle is not free, the dispersion relation is usually more 
complicated. Thus, as we have seen in Section 24.3a, for an electron moving 
in one-dimensional array of potential wells the dispersion relation is given 
by Eq. 24.3 


sin yd 


P + cos yd = cos kd 


mE 
Yo NG 


We saw that Eq. 24.3 is transcendental and must be solved numerically. We 
pick a value of E, substitute it in Eq. 24.3, and obtain the value of k for which 
the relation holds. If the procedure is repeated systematically for other values 
of E, we will be able to make a table listing values of E and the corresponding 
values of k. As we proceed with these numerical calculations, we find that 
there are energy intervals for which no real solution for k exists. These are 
the values of E for which the left side of Eq. 24.3 is either greater than +1 
or less than —1. As indicated in the previous section, this is physically un- 
acceptable and, therefore, these energy values are forbidden. Results from 
the numerical method just described are illustrated by the solid lines of Fig. 
24-10. We note that these solid lines yield the values of E and the correspond- 
ing values of k for certain ranges of E. For other energy intervals the value 
of kis not defined by the solid lines, these are the forbidden energies. Allowed 
and forbidden energies are projected to the right in Fig. 24-10 to represent 
the scheme of allowed and forbidden energy bands. Another important result 
to be noted in Fig. 24-10 is that the curvature of the solid lines is not the same 


(24.3) 


where 
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FIGURE 24-10 

The solid lines show the depen- 
dence of the energy E on the wave 
vector k (and therefore, the mo- 
mentum p) for an electron mov- 
ing in the periodic array of rec- 
tangular potential wells of Fig. 
24-4. The dashed line represents 
the relation between the energy 
and the mementum for the free 
electron case. The values of E for 
which Eq. 24.3 yields a real num- 
ber for k are projected to the right 
of the figure as allowed energy 
bands (shaded bands). These are 
separated by ranges of E for which 
Eq. 24.3 yields an imaginary so- 
lution for k and correspond to 
forbidden energy values. 
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as that of the dashed line, which represents the dispersion relation for the 
free particle. This has important implications concerning the effective mass 
of the electrons and will be discussed in detail in Section 24.6. 

Although the Kronig-Penney model presented here shows clearly the 
existence of allowed and forbidden energy bands and at the same time gives 
us a mathematical expression with which to find the E versus k curves, it 
does not give much physical insight for the existence of these bands. More- 
over, as presented here, it does not answer a question that will be important 
in the understanding of the difference between conductors, insulators, and 
semiconductors. The question is: How many energy states are allowed within a 
given band? The answer so far seems to be an infinite number because within 
a band y can vary continuously. It would seem, therefore, that E can take an 
infinite, continuous range of values within an allowed band. The reason for 
this result is that in the periodic potential model we have assumed that the 
periodicity is infinitely long. In a real solid we have boundaries and, although 
we may have 10” potential wells, it is still a finite number of wells. If one 
introduces the boundary condition that we used in the infinite potential well 
model of Chapter 23, namely, y = 0 at the boundaries of the solid, one finds 
that the continuous spectrum within a band breaks into a quasicontinuous 
one. Rather than doing this directly, let us look at an alternative method of 
showing the existence of bands. This method is less quantitative, less math- 
ematical, and less rigorous (although when used to do actual calculations it 
can be made rigorous), but it clearly shows the physical reason for the bands. 
This method will also yield the number of energy states allowed within a 
given band. 


24.4 TIGHT-BINDING APPROXIMATION 


One useful way to look at the formation of allowed and forbidden energy 
bands is to start with the energy levels of the individual neutral atoms when 
they are very far apart and watch the changes in these levels as the atoms 
get close together and the charge distributions of adjacent atoms begin to 
overlap. We can gain some insight into what happens by studying a simple 
one-dimensional quantum mechanical problem: that of two finite square po- 
tential wells. 

When we examined the one-dimensional infinite potential well of width 
a, Chapter 20, we found that the eigenfunctions and corresponding energy 
values that the particle may have are given by Eqs. 20.25 and 20.26. | 


xy, = Bsinn a (20.25) 
E,= WE, n= 1,2,3,... (20.26) 
where 
2a? 
Eo = a 
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In Fig. 24-11 we show the first two eigenfunctions for the infinite potential 
well (see Fig. 20-4). For the finite well the results are similar but with minor 
differences. One such difference is that the eigenfunctions do not vanish at 
the boundary but extend a little bit outside the well (see Fig. 24-12). 


Let us consider the two finite potential wells B and C shown in Fig.. 


24-13a. If we have an electron with energy E, that we know is definitely in 
well B, the y describing such an electron will look as in Fig. 24-13b; yg and 
therefore yg differ from 0 only in the region of the well B. Suppose instead 
that the electron is definitely in well C with energy E;. Then the eigenfunction 
will appear as shown in Fig. 24-13c; yc and therefore x2 differ from 0 only in 
the region of well C. But let us assume that the electron can be found with 
equal probability in both wells with energy E,. What eigenfunction do we 
use to describe the electron? To answer this question, let us put forward the 
properties that such a wavefunction must have. 


1. y must reflect the fact that the electron can be found with equal probability 
in both wells. This means that the probability y? must be symmetric with 
respect to a point halfway between the two wells. 


2. The part of y that reflects the probability that the particle be found in well 
B with energy E, must look like the eigenfunction associated with that 
energy when the particle is in well B; that is, it must look like Fig. 24-13b. 
And the same applies to the part of the eigenfunction that reflects the 
probability of finding the particle in well C. 


Before we answer the question of eigenfunction selection, let us consider 
the following. We said that yc is an eigenfunction that can represent an 
electron in well C with energy E,. Actually — yc could have been used instead. 
It has the same physical properties as yc, and it is a solution to the Schrédinger 
equation for the same value of the energy E,,° and, moreover, yé = (— yc). 
Therefore, — y must be considered equally with + y. 

Let us now answer the question: What eigenfunction do we use to de- 
scribe an electron with energy EF, that can be found equally in both wells? 

There are two possibilities: 


Ns = 4 (xB + Xc) 


NA a (XB — Xc) 


“Does —yc satisfy the Schrédinger equation for E,? Let us substitute it into the Schrédinger 
equation, 


h?. d? (—xc) 

— et Ee (~ Xe) = Fi (~ Xe) 
Multiplying both sides of this equation by —1, we get 
ee 

2m dx* 


+ E, X¥c = Ei Xe 


which demonstrates the same equality as does xc; so both yc and — yc are equivalent solutions 
to the Schrédinger equations for the same energy EF). 


FIGURE 24-11 

(a) Ground state eigenfunction y, 
and (b) first excited state eigen- 
function x2 for a particle in an 
infinite potential well. 


(b) 


FIGURE 24-12 
(a) Ground state eigenfunction y, 


and (b) first excited state eigen- 
function y2 for a particle in a fi- 
nite potential well. 
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FIGURE 24-13 
(a) Two finite potential wells B 
and C (b) Eigenfunction associ- 
ated with an electron that is def- 
initely in well B with the ground 
state energy. (c) Eigenfunction 
ve associated with an electron that 
/ \ is definitely in well C with the 


ground state energy. 


Kc = alXp + Xe) 


FIGURE 24-14 

(a) Symmetric eigenfunction rep- 
resenting an electron that can be 
found with equal probability in 
the two wells of Fig. 24-13 with 
the ground state energy. (b) 
Probability density associated 
with the symmetric eigenfunc- 
tion in (a). 


(@) 


FIGURE 24-15 
Antisymmetric eigenfunction 


representing an electron that can 
be found with equal probability 
in the two wells of Fig. 24-13 with 
the ground state energy. (b) 
Probability density associated 
with the antisymmetric eigen- 
function of (a). 


The constant a is introduced for normalization purposes; its value is not 
important to our arguments. It should be clear that y; (symmetric) shown in 
Fig. 24-14a satisfies the conditions set forth here. Figure 24-14b shows that 
x3 is symmetric with respect to the midpoint between the wells, and this 
reflects the fact that t » particle can be found with equal probability in the 
two wells. The other possibility, y4 (antisymmetric), Fig. 24-15a, also satisfies 
the conditions set forth, as can be seen by eompanins Fig. 24-15b for x4 with 
Fig. 24-14b for y2. 
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Xs = alXp + Xe) 


— ) i 

(a) + ——? = — if - 
FIGURE 24-16 " 

When the two wells are very close together, the symmetric eigenfunction of Fig. 24- 

14 looks like the ground state eigenfunction for a finite well of width 2a (see Fig. 24- 

12a). 


Let us see what happens to these two eigenfunctions when the separation 
between the two wells becomes very small. Figure 24-16a illustrates the in- 
dividual eigenfunctions yz and yc, and Fig. 24-16b is the sum xs = 4 (xB + Xc). 
The eigenfunction ys begins to look like the ground state eigenfunction for 
a well of width 2a. In fact, in the limit of no separation it becomes the 
ground state eigenfunction. On the other hand, the antisymmetric eigen- 
function appears as in Fig. 24-17. Figure 24-17a shows the individual eigen- 
functions yg and —yc, and Fig. 24-17b their sum, x4 = 4 (ys — Xc). The 
wavefunction y, begins to look like the wavefunction of the first excited state 
for a well of width 2a. 

We conclude that although when the two wells were far apart both xs 
and y, were degenerate eigenfunctions (same energy states), the degeneracy 
begins to disappear as the two wells get close to each other; x4 corresponds 
to a state of higher energy than ys. 

In our discussion we assumed that the electron can be in either well B 
or well C. Of course, when the two wells are far apart the question is purely 
academic. However, when they come close together so that the wavefunctions 
from the two wells overlap, it is possible for the electron that initially was in 
well C to move into well B, because its wavefunction is not zero at the location 
of well B, and the same holds for the electron that initially was in well B; it 


Ka = alXp — Xe) 


(b) 


FIGURE 24-17 
When the wells are very close together, the antisymmetric eigenfunction of Fig. 24- 
15 looks like the first excited state of a finite well of width 2a (see Fig. 24-120). 
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FIGURE 24-18 
Radial part of the ground state eigenfunctions of two isolated hydrogen atoms as a 
function of the distance of the electrons from the respective nuclei. 


can move into well C because its wavefunction has a finite value in well C. 
Thus, even if the electron was initially in one well, after the two wells are 
brought close together, if we wait long enough, there is no way to predict 
exactly in which one it will be. Similarly, if we had started with two wells 
and one electron in each well, after they are brought together, there is no 
way of saying in which well either of the two electrons will be. The only thing 
that we can say is that the probability of finding an electron in one well is 
the same as the probability of finding it in the other. Either y,4 or ys will be 
appropriate to describe its behavior because both v4 and yg are symmetric 
with respect to the midpoint between the wells. 

_ The important conclusion from this simple artificial example is that if you 
start with two identical y's (same energy) in two identical independent systems, when 
you bring the two together, the two degenerate x's break up into two nondegenerate 
x's. However, this example does not give the reason for this result. What is 
the physical reason for the breakup of the degeneracy? 

Let us look at a real example of two hydrogen atoms, each with its electron 
in the 1s ground state. If one solves the radial part of the Schrédinger equation 
for the hydrogen atom, Eq. 21.6, one obtains the radial part of the eigen- 
function for the 1s state as y = e ””. In the one-dimensional sketch, the 
eigenfunction decreases exponentially as the distance r from the nucleus in- 
creases. The eigenfunctions yz and yc in Fig. 24-18 are those associated with 
the two independent atoms B and C. As the two atoms are brought together, 
the eigenfunctions overlap, and the electrons from B and C can change places. 
We are led to consider, just as before, the two possible combinations of x; 
and yc shown in Fig. 24-19, where the dashed lines represent how the ei- 
genfunctions of each would appear if the other were not present, and the 
solid line is the sum or combined eigenfunction. The electron distribution 
between the two protons can be seen by plotting the probability functions 
lvs|? and |y,4|?, Fig. 24-20. Both distributions are symmetric with respect to the 
midpoint between the two protons, and therefore the probability of finding 
an electron at a certain distance from one proton is the same as the probability 
of finding it at the same distance from the other proton. Both eigenfunctions 
are thus suitable to represent the behavior of either electron. However, an 
electron in a state ys has a lower energy than one in y,. The reason is that 
an electron in ys is more likely to be between the two protons than being 
near to just one. Figure 24-20a shows that the probability function y2 in the 
region between the two protons is greater than on either side of the protons. 


Ke = alXp + Xe) 


(b) 

FIGURE 24-19 

(a) Symmetric combination of the 
ground state eigenfunctions of 
the two individual hydrogen at- 
oms of Fig. 24-18. (b) Antisym- 
metric combination of the same 
two eigenfunctions. 


Se 
(b) 

FIGURE 24-20 
(a) Probability density associated 
with the symmetric eigenfunc- 
tion of Fig. 24-19a. (b) Probability 
density associated with the an- 
tisymmetric eigenfunction of Fig. 
24-19b. Note that x2 is large in 
the region between the two nu- 
clei so therefore the electron rep- 
resented by ys spends consider- 
able time between both nuclei. 
x2 does not have this feature. 
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As a result, the electron spends a considerable amount of time between the 
two protons. In this region it is under the attractive influence of both protons 
at once. The binding energy of the electron resulting from the presence of 
the two protons will be more negative than if it was only under the influence 
of one of them. On the other hand, an electron in state y,4 spends its time 
with either one proton or the other. It is hardly ever with both (see Fig. 
24-20b) and, as a result, this extra contribution to the binding is not there or 
at least is very small. 

Thus, when two atoms are brought together, two separate energy levels are formed 
from each level of the isolated atom. The physical reason for this effect is the differing 
ways that the electrons interact with the ions in the symmetric and antisymmetric 
states. 

This is not a band, but two atoms do not make a solid. Suppose that N 
atoms are brought together to form a solid, then each of the levels of the 
individual isolated atoms breaks up into N discrete, closely spaced levels and 
becomes a band of energy levels. This can be illustrated with the 1s level of 
six hydrogen atoms. If we start with six individual 1s states and consider all 
the possible ways of adding the six individual 1s states, we get six types of 
combinations having different energies (see Fig. 24-21). In the first level, the 


6th level 
(Highest FE) 


lst level 


( L se hii lamas, 
t + d 5 6 


FIGURE 24-21 
Six possible combinations of the 


ground state eigenfunctions of 
six hydrogen atoms, each cor- 
responding to a different energy 
state. At the bottom of the figure 
the six individual eigenfunctions 
are sketched. The sketches above 
them represent six possible ways 
of adding them either symmet- 
rically or antisymmetrically. 
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six individual eigenfunctions all add symmetrically, 


Xfirst level = X1 + X2 + X3 + Xa + NS + Neo 


As a result, there are five places along the lattice where the electron will be 
under the attractive (binding) influence of two nuclei. In the second level, 
the y's from the first three atoms add symmetrically among themselves and 
so do the last three, but the resulting y from the first three adds antisym- 
metrically with the resulting y from the last three: 


Xsecond level (x1 a X2 + Xs) -_ (X4 + X5 + Xe) 


As a result, there are only four places along the lattice where the electron will 
be under the simultaneous influence of two nuclei that gives the extra negative 
contribution to the energy. The energy of an electron in this level will be 
higher than that of an electron in the first level. The combination of the 
individual y’s to form the rest of the levels is done in a similar way as the 
first two examples. The result concerning the energy of each can also be 
reasoned in a similar way: In the third level, there are only three places where 
the electron would be under the simultaneous influence of two nuclei and, 
consequently, the energy of the level will be higher than the previous two. 
This goes on until the sixth level, where this extra contribution to the binding 
is nowhere in the lattice. Therefore, this state corresponds to the highest 
energy. 

Are these the only possible combinations of the six x’s? The answer is 
no; but any other combination will be degenerate with one of the preceding 
as shown in the examples of Figs. 24-224 and b. The two examples shown in 
Fig. 24-22 are different combined ;’s, but they both have the same energy as 
the second level of Fig. 24-21. You may play the permutation game to achieve 
other combinations, but you will find that as far as the energy is concerned, 
all will fall within one of the six considered in Fig. 24-21. 

It should be apparent that when we extend this analysis to a lattice of N 
atoms, the individual identical states of the atoms will give rise to N different 
energy states. However, regardless of the number, the two extremes will look 
like the first and sixth level of our example. It should also be apparent that 
the energy difference between the two extremes and, therefore the width of 
the band, should not depend appreciably on N. Increasing N increases the 
number of sites where the extra contribution to the energy can take place. 
But it simultaneously decreases the amount of time that the electron spends 
in any one site, making the total time in all such sites constant. What affects 
the width of the band is how close any two atoms are to each other. The 


(a) (b) 

FIGURE 24-22 

Two alternative combinations of the six eigenfunctions in Fig. 24-21 that correspond 
to the same energy as the second level of Fig. 24-21. 
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interatomic separation 


closer, the greater the overlap of the wavefunctions and therefore the larger 
x? will be between the two nuclei for symmetric states, with the consequence 
of stronger binding. Figure 24-23 shows how the wavefunction for six different 
atoms separates into six different energy levels when the atoms are brought 
close enough together. And the closer they are brought, the greater is the 
separation of the energy levels. For the usual interatomic separation found 
in a solid, the width of the band is typically a few electron volts. For a 
macroscopic solid with N ~ 10”, the separation between adjacent levels will 
be ~ 10°* eV, an insignificant amount. 

We thus have shown that when one brings a large number N of atoms together 
to form a solid, the individual atomic energy levels of the atom break up into a 
quasicontinuous energy band. Within the band there are N distinct but very close 
energy levels. 

So far, we have limited ourselves to one-electron atoms where the electron 
is in the 1s state. When the analysis is extended to multielectron atoms where 
we have electrons in other states, one finds that each of the individual atomic 
states breaks up into similar bands of quasicontinuous states. Thus, if we 
consider sodium (Na) with an electronic configuration 1s? 2s? 2p® 3s* (Section 
21.7b), we may expect the band structure illustrated schematically in Fig. 
24-24. Notice that the bandwidth for the low-lying levels is smaller than for 
the higher energy ones. The reason is that electrons in the lower levels are 
electrons in the inner subshells of the atoms; these electrons are not influenced 
much by the presence of other atoms because their wavefunctions do not 


33 


2p 


2s 


3s band 


2p band 


2s band 


ls band 
Solid 


FIGURE 24-23 

Splitting of the 1s state of six hy- 
drogen atoms into a band of six 
energy levels as the separation 
between the atoms decreases. 
Note the increase in the band- 
width with decreasing intera- 
tomic separation. 


FIGURE 24-24 
Expected splitting of the first four 


atomic levels of sodium into four 
energy bands in a sodium crys- 
tal. 
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overlap significantly with those of the electrons of other atoms. Therefore, 
they give rise to narrower bands. 

The situation depicted in Fig. 24-24 is what one may expect in general. 
The situation for real solids is somewhat more complicated. We must realize 
that we have been using qualitative arguments and one-dimensional models 
for the sake of mathematical as well as conceptual simplicity. These models 
have educational value because they bring out the main features of band 
theory. However, if we want to get theoretical results that can be compared 
with those of the experiments, we must face the three-dimensional world, 
and our qualitative arguments must become quantitative. When this is done, 
we should not be surprised that the simple pictures may have to be modified 
somewhat. The main features that we have found—namely, that each atomic 
level breaks up into a band and that in each band there are N energy levels— 
are retained. These two features will enable us to understand the differences 
between conductors, insulators, and semiconductors. 


24.5 CONDUCTORS, INSULATORS, 
AND SEMICONDUCTORS 


We are now in a position to understand why some solids are good conductors, 
and some are not. We must keep in mind two facts. 


1. As we explained in Section 23.3f, for electrons to experience an acceleration 
in the presence of an electric field € and therefore to contribute to the 
current, they must be able to move into new, slightly higher energy states. 
This means that the states that are available for the electrons must be both 
empty and allowed. For example, if relatively few electrons reside in an 
otherwise empty band, a large number of unoccupied states are available 
into which the electrons can move; these electrons can acquire energy from 
the electric field and contribute to the current. On the other hand, if a 
band is full, then the electrons in that band cannot contribute to the current 
because they cannot move into slightly higher energy states. They therefore 
cannot be accelerated by the electric field. 


2. There is a limit to the number of electrons that can be placed in a given 
band. We know that there are N different energy states in each band. If 
the band is an s band (one formed from atomic s states) then, the orbital 
quantum number / = 0 and therefore m, = 0 and m, = +1/2. We can 
place two electrons in each of the N states without violating Pauli’s exclu- 
sion principle. In a p band / = 1, m, = 0, 1, —1 and for each value of m,, 
m, = +1/2. In each of the N energy levels we can put six electrons. In 
general, because for a given / there are (2/ + 1) values of m; and for each 
m, there are two values of m,, we have 2(21 + 1)N openings available to 
the electrons in a given band; for example, in a d band the number is 
2(2 x 2 + 1)N = I0N. 
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Let us consider some hypothetical examples on the basis of these two 
facts. Consider a solid of N atoms with each atom having 11 electrons. 
Altogether there are 11N electrons. 2N electrons may be put in the 1s band, 
2N in the 2s band, 6N in the 2p band. There remain N electrons that may be 
placed in the next available band, the 3s. But the 3s band has room for 2N 
electrons, and therefore it will be only half full (see Fig. 24-25). As a conse- 
quence, half the states in the 3s band (the unoccupied ones) are available to 
the N electrons in that band. Because there are available states, the N electrons 
in this band can be accelerated by an electric field and move into higher 
energy states. This solid would be an electrical conductor. Note that because the 
N electrons in the 3s band obey the Pauli exclusion principle, they obey Fermi- 
Dirac statistics; that is, at T = 0 K they all reside in the lowest energy levels 
of the band and the highest occupied level becomes the Fermi level. For T > 
0, a few (as we saw in the previous chapter) can be above the Fermi level. 
The N electrons in the 3s band behave as predicted by the QMFE model with 
one minor modification that we will discuss later. We can see that the effect 
of the periodic potential on the motion of electrons in a solid with a partially 
filled band is unimportant, and that is the reason why the QMFE model was 
so successful in predicting the properties of conductors. This hypothetical 
solid is of course Na (Z = 11). The only difference between the hypothetical 
solid and the real Na is that in sodium metal the next band, the 3p, overlaps 
the 3s band. But all that this does is to provide additional empty energy levels 
for the N electrons in the 3s band. 

The same arguments used for Na apply to lithium (1s* 2s’), potassium 
(1s? 2s? 2p® 35? 3p® 4s1), as well as to rubidium and cesium. (See the periodic 
table in Chapter 21.) 

It should be noted that the highest energy band containing electrons is 
called the valence band. If, as in the case of sodium, this band is only partially 
filled, it is also called the conduction band because electrons in that band are 
responsible for conduction processes. 

Next consider magnesium (Z = 12) with electronic configuration 1s* 2s? 
2p® 3s?. Ina solid with N atoms, there are 12N electrons. Following the previous 
scheme, 2N electrons go into the 1s band, 2N into the 2s, 6N into the 2p, and 
the remaining 2N into the 3s. All 12N electrons have been accounted for in 
the process. They have completely filled the 3s band. According to our pre- 
vious argument, there are no empty energy states available for the electrons 
in the 3s band to move into; therefore, they cannot contribute to conduction. 
With no empty states through which charged particles may contribute to 
conduction, Mg should be an insulator. But it is not. The reason is that as in 
the case of Na, the 3p and 3s bands overlap (see Fig. 24-26). Because the 3p 
band has 6N empty states, the 2N electrons have available to them 2N + 6N 
states of which only 2N are occupied. Mg is therefore a conductor. Similar 
arguments apply to beryllium, calcium, zinc and barium, all of which are in 
the same group in the periodic table. 

Now consider carbon (1s? 2s? 2p*) in its diamond structure. As N atoms 
of C are brought together, they have 6N electrons. 2N fill the 1s band, 2N fill 
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Schematic of the occupation of 
the bands by electrons in a so- 
dium crystal of N atoms and hav- 
ing, therefore, 11N electrons. The 
highest energy band with elec- 
trons (3s band) is only half full 
with N electrons, and thus so- 
dium is a monovalent metal. The 
3s band is the conduction band 
of sodium. 
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FIGURE 24-26 

Occupation of the energy bands 

in a magnesium crystal of N at- 

oms (12N electrons). The overlap 

between the 3s and the 3p bands 
make magnesium a metal. 
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the 2s band, and there are 2N electrons left to place in the next available band, 
the 2p, which has room for 6N electrons. The 2p band would be a partially 
filled band with plenty of empty states available (see Fig. 24-27). Diamond 
should therefore be a conductor. But it is not. It is an excellent insulator. As 
we mentioned before, the qualitative arguments elucidate the main features 
of band structure. However, when dealing with a specific crystalline material, 
the arguments must become quantitative. When this is done, interesting fea- 
tures occur, such as band overlap. In the case of diamond, germanium, and 
silicon, an even more interesting feature is revealed. 

As the carbon atoms are brought together to form diamond, the energy 
levels begin to split into bands starting with the outermost shell, n = 2 (2s 
and 2p levels) (Fig. 24-28). As the interatomic spacing decreases farther, the 
2s and 2p bands begin to overlap and merge into a single '2s 2p’ band with 
8N states available. As the separation decreases even farther, approaching 
the interatomic equilibrium spacing fo, the '2s 2p’ band splits again into two 
hybrid bands separated by an energy gap E,, which increases with decreasing 
separation. The value of E, is about 6 eV for the equilibrium distance of 7p ~ 
1.5 x 10~?° m. However, each of these two bands now contains 4N states. 
The result: Of the total 6N electrons, 2N go into the 1s band and the remaining 
AN into the lower hybrid '2s 2p’ band and fill it. Thus, at T = 0 K the valence 
band (the lower '2s 2p’ band) is full (Fig. 24-29), and diamond is an insulator. 
Note that this is only true for the diamond structure of carbon, not for graph- 
ite. The bands of germanium (Ge) and silicon (Si) show a similar behavior. 
In the case of Si, the mixing and subsequent splitting occurs between the 3s 
and 3p, whereas in the case of Ge it occurs between the 4s and the 4p. There 
is, however, an important quantitative difference between diamond and Si and 
Ge. The energy gap E, between the filled valence band and the next empty 
band for Ge and Si is much smaller than for C: E, (Ge) = 0.7 eV, E, (Si) = 
1.1 eV. At T = OK, pure C, Si, and Ge behave identically. They are perfect 
insulators because the valence band is filled. However, as T increases, some 
of the electrons in the valence band can be thermally excited across the energy 
gap into the next band, which now becomes the conduction band, and as a 
result electrical conduction can take place. How many electrons can be excited 


lara] 
ro 


Number of es 
n the Dana 


2D. wm | 2N 


[2 


ZN 


FIGURE 24-27 

Expected occupation of the en- 
ergy bands in a diamond crystal 
of N atoms. 


FIGURE 24-28 

The splitting of the atomic en- 
ergy levels of carbon into energy 
bands is followed by the merging 
of the 2s and 2p bands and a sub- 
sequent splitting of these bands 
as the interatomic spacing de- 
creases. At the equlibrium inter- 
atomic spacing fo, an energy gap 
E, separates two hybrid ‘2s 2p’ 
energy bands in a diamond crys- 
tal. 
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Hybrid 2s 2p 4N states, no electrons 


Hybrid 2s 2p 4N states, 4N electrons 


ls jo k ZN States, 2N electrons 


depends on how big E, is and, of course, on T. The higher T, the greater the 
thermal energy, and therefore the greater the number of electrons that will 
be able to make the jump across the energy gap and, naturally, the greater 
the electrical conductivity. This is the reason why the conductivity of an 
insulator and of a semiconductor increases with T (as opposed to a metallic 
conductor). We will discuss this further in the next chapter. 

The number of electrons in the conduction band at a given temperature 
will depend on E,, the magnitude of the energy gap between the valence and 
the conduction bands. The smaller E,, the greater the probability that the 
electrons at the top of the valence band will jump into the conduction band. 
Thus, at a given temperature, we expect that Si (E, ~ 1 eV) will have more 
electrons in the conduction band and therefore, be a better conductor than 
diamond (E, = 6 eV). In fact, as we will show in Chapter 25, the probability 
of transition across the energy gap is very sensitive to the magnitude of E,. 
A doubling of E, will reduce the number of conduction electrons by several 
orders of magnitude (powers of 10). It is the magnitude of E, that determines 
whether a solid is an insulator (diamond) or a semiconductor (Si, Ge) at 
ambient temperatures. Detailed calculations of the number of conduction 
electrons and its dependence on E, and T are presented in Chapter 25. 


24.6 EFFECTIVE MASS 


When an electric field € acts on a free electron, it exerts a force e@ that, from 
Newton’s law, will produce an acceleration inversely proportional to its mass, 
a = eé/m. What happens when the electron to be accelerated is not free but 
happens to be in a crystal under the influence of the potential of the lattice 
ions? The answer is that it will still accelerate according to Newton’s law; 
however, the electron responds as if it had some effective mass, which is 
different from its true mass. As we will show, this is because © is not the 
only electric field acting on the electron inside the crystal. 

We will introduce this concept by using a semiclassical picture: an ar- 
gument that is half classical and half quantum mechanical. The quantum 
mechanical part lies in the fact that the motion of an electron is governed by 
a wave, and that the velocity of the electron is equal to the group velocity 
Veroup Of the wave, that is, the velocity of the envelope, of the wave packet. 
In our treatment of matter waves and wave packets (Section 19.7), we saw 


FIGURE 24-29 

Actual occupation of the energy 
bands in a diamond crystal with 
N atoms (6N electrons). The lower 
hybrid ‘2s 2p’ band is full and 
separated by an energy gap E, 
from the higher hybrid ‘2s 2p’ 
band, which has no electrons. 


that the group velocity is given by Eq. 19.10: 
dE 

dp 

where E is the energy of the particle and p is its momentum. 


In the case of a free particle, we can readily show that the group velocity 

is equal to the particle velocity. For a free particle, the energy 
1 2 
E= 9 MUearticle = _ 


(19.10) 


Ugroup = 


Therefore 


Ugroup ~~ 


dE_ A (PY) _ PB _ MOpanice 
dp _ dp mM m m particle 


Although we have shown that Ugroup = Uparticle for the free particle case only, 
it can be shown that the relation holds even when the particle is not free, 
such as the case of an electron in a lattice. 

Equation 19.10 defines the group velocity in terms of the energy E and 
the momentum p of the particle. As we have seen in the Kronig-Penney model, 
the energy is often expressed in terms of the wave vector k. It is convenient, 
therefore, to define the group velocity in terms of E and k. This can be done 
by using de Broglie’s relation p = h/A (Eq. 19.1) and the fact that A = 2a/k 
(Eq. 11.12). Combining Eqs. 19.1 and 11.12 we have p = hk and hence dp = 
hdk. Substituting this result for dp in Eq. 19.10, we obtain 


_ dE 
Ugroup ~ h dk 
The classical part of the argument uses the definition from mechanics 


that if a force does work dW on a particle, the energy of that particle increases 
by the same amount: dE = dW. Applying this to the present case, we have 


(24.5) 


d 
dE = dW = é6 dx = &6 — dt = cv, di 
The rate at which the energy of the particle is changing is therefore 
dE 
Tt : €& Ug (24.6) 


We also know from Newton’s law that when a force acts on a particle, 
it will be accelerated. By definition, the acceleration a is 


a= AV particle = dv, 
dt dt 
Substituting Eq. 24.5 for v, we obtain 
_lddk 
hh dt dk 
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Interchanging the order of the differentiation of E, we may write 
a (24.7) 


Substituting Eq. 24.6 for dE/dt in Eq. 24.7 yields 
eé dv, 


he dk 


From Eq. 24.5, it follows that 


Rearranging terms, we get 


e&& =—— a (24.8) 


Noting that e@ is the force of the externally applied electric field, we conclude 
that Eq. 24.8 has the form F = m*a, where 


h,2 
m* = dE (24.9) mi” 
dk? 
The response of the electron in the solid to an externally applied electric field 
is as if it had an effective mass m* given by the expression in Eq. 24.9. Let 
us see if Eq. 24.9 gives the correct result for the free electron case. From Eq. 
24.4, for the free electron 


E=— (24.4) 


and 


Substitute this result into Eq. 24.9 and obtain 
h2 
cia 
m 


m = =m 


When the electron is free, the effective mass is the true mass, as it should 
be. However, when the electron is in a crystal, m* is different from m because 
the energy is not proportional to k*, as we saw in the Kronig-Penney model. 

The physical reason is the following. The electron in the crystal moves. 
under the influence of internal forces exerted by the electric fields of the ions 


of the lattice and the external force resulting from the externally applied electric 
field. If we choose to use Eq. 24.8 to describe the motion of the electrons, we 
describe the motion in terms of the external force alone. However, the effect of 
the internal forces is hidden in m”*. 

Let us recall the results that we obtained from the Kronig-Penney model 
concerning the relation between E and k (Fig. 24-10). The relation for the first 
allowed band is represented by the solid line in Fig. 24-30a. For small k’s both 
curves, the free electron one (the dashed line) and the one obtained from the 
model, are quite similar. Both the first and the second derivatives are almost 
the same; therefore, m is about equal to m*. Notice, however, that in the 
Kronig-Penney curve the first derivative (the slope) first increases for small 
values of k and then decreases as k approaches a/d. We illustrate a qualitative 
plot of the slope dE/dk versus k in Fig. 24-30b. We may also draw a qualitative 
illustration of d?E/dk? from Fig. 24.30b. Its appearance is that of Fig. 24-30c. 
The reciprocal of d?E/dk? is proportional to the effective mass, from Eq. 24.9. 
These reciprocal curves are shown in Fig. 24-30d. We see that for small values 
of k the effective mass m” is essentially equal to the mass of a free electron 
m. As dE/dk approaches the maximum, d*E/dk? begins to decrease and m”* 
increases. When dE/dk reaches the maximum, d?E/dk? = 0 and m* becomes 
infinite. Subsequently, d?E/dk* becomes negative and m* is negative. 

We may draw the following conclusions about the effective mass m* of 
an electron moving in a periodic lattice (see Fig. 24-30d). 


1. m* is not always equal to m. 
2. m* can be greater than m and, in fact, infinite. 
3. m* can be less than m or even negative. 


We can gain some understanding of the behavior of m* if we consider 
the way an electron wave is reflected by the lattice ions. We know from Section 
12.7 that a wave is totally Bragg reflected if 


2d sin 0 = nA (12.11) 


In the case of a one-dimensional lattice that we have been considering, 
6 = 90° (Fig. 23-21) and Eq. 12.11 becomes 


2d = nar 


FIGURE 24-30 

(a) Dependence of the energy E of an electron on its wave vector k, and hence its 
momentum p (solid line) for the first allowed energy band of the Kronig-Penney model 
(see Fig. 24-10). The dashed line illustrates the relation between E and k for the free 
electron case. (b) Qualitative plot of the derivative of E with respect to k as a function 
of k for the solid line of (a). (c) Qualitative plot of the second derivative of E with 
respect to k as a function of k. (d) Dependence of the effective mass m* of the electron 
on the wave vector k for the situation depicted by the solid line of Figure (a). Note 
that the effective mass is negative near the top of the band, that is, as k approaches 
aid. 
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This is sketched in Fig. 24-31. We can express the Bragg equation in terms of 
the wave vector k = 27/X 


and 
1 
k=n F 

At the bottom of a band where k ~ 0, there is practically no reflection because 
the Bragg condition is far from being satisfied. The lattice ions will have little 
effect on the electron when it is accelerated by the external field €. Higher 
up in the band, k will get closer to the critical value w/d and reflection starts 
to become appreciable. In this region, as the external field © accelerates the 
electron, the momentum increases and gets closer to the critical value. The 
external field increases the forward momentum, but at the same time enhances 
reflection, and the reflection corresponds to reversing the sign of the mo- 
mentum. At the point where 1/m* = 0 (that is, m* = ©), the gain in the 
forward momentum resulting from the applied € is exactly compensated by 
the resulting enhancement in reflection by the lattice ions. The net change in 
the forward momentum is zero. Thus the overall response of the electron to 
the field € is as if it had an infinite mass; that is, it cannot be accelerated. At 
the top of the band, even closer to the critical value for total reflection, the 
second effect (enhanced reflection) is more important than the direct action 
of the applied field. The net result is that the electron responds with a change 
in momentum that is in the opposite direction to what the free electron would 
have acquired: the electron responds as if it had a negative mass. 

The concept of effective mass has many uses. For example, in the free 
electron models we derived an expression for the electrical conductivity o by 
considering the response of the electrons to an electric field, and we found 
the conductivity o@ « 1/m (Eq. 23.14). We can easily make the correction to 
the model, that is, account for the fact that the electrons are not really free 
but instead they move in the electric potential of the ions. All that has to be 
done is to replace the true mass m by the effective mass m*. In many metals 
this has little effect because m*.~ m (for example, Cu, Na, Al, and K). But in 
some metals it has a significant effect. As an example, the average m* for iron 
(Fe) is about 10 times the free electron mass m, and this is one reason why 
iron is not a very good electrical conductor. 

What about negative mass? Are there any cases where the electrons travel 
in the same direction as the electric field? These questions lead us to our next 
topic. 


24.7 HOLES 
The concept that electrons near the top of the band have negative effective 


mass leads to a very interesting feature that has a tremendous importance in 
the operation of all semiconductor devices. 


At T = OK, the band structure of a semiconductor is characterized by a 


FIGURE 24-31 

Bragg scattering of an electron 
wave by a one-dimensional array 
of ions. 
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FIGURE 24-32 
(a) At T = OK, the valence band of a semiconductor is filled with electrons and 


separated by an energy gap E, from an empty conduction band. (b) At T > 0 K electrons 
are thermally excited into the conduction band, leaving behind in the valence band 
unoccupied energy states called holes, which behave as mobile positive charge carriers. 


fully occupied valence band and a completely empty conduction band (see 
Fig. 24-32a). The semiconductor ideally is an insulator with zero conductivity 
at T = OK. As the temperature is raised, some electrons in the valence band 
can receive enough thermal energy and be excited into the conduction band 
because the energy gap between the two bands is rather narrow. The result 
is that there are some electrons in an otherwise empty conduction band and 
some unoccupied states in an otherwise filled valence band, see Fig. 24-32b. 
An empty state in the valence band is called a hole. 

The electrons in the conduction band can move under the influence of 
an external electric field because they have available to them many empty 
higher energy states, and they can contribute to the current density J. Simi- 
larly, the electrons in the valence band can move into the empty states (holes) 
left by the electrons that were excited into the conduction band. We will 
assume, based on our previous discussion, that the empty states at the top 
of the valence band are negative effective mass states. 

The interesting and important feature that we mentioned before is that 
the conduction by the electrons in the valence band as they move into the empty 
negative mass states is completely equivalent to the conduction by particles of positive 
charge and positive mass. The number of such +g, +m particles is equal to the 
number of available empty states, that is, the number of “‘holes.”’ Basically, what 
we are saying is that when considering the contribution to the electric current 
from the valence band, we ignore the electrons, and instead we treat it as if 
conduction took place via positively charged holes. 

There are many analogies to explain the phenomenon of electrical con- 
duction by holes, although one must be careful not to carry them too far. We 
present now one such analogy given by Rudden and Wilson.* 

Consider the seats in a theater, and let us assume that the orchestra 
section is separated by a gap from the mezzanine. Moreover, let us assume 
that the orchestra is completely empty and that each seat in the mezzanine 


4M. N. Rudden and J. Wilson, Elements of Solid State Physics (New York: John Wiley & Sons, 
1980). | 
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is occupied. No motion of people from one seat to another can take place as 
long as everybody stays in the mezzanine. However, suppose that one person 
moves to the orchestra section, and thus leaves an empty seat behind. This 
seat can then be occupied by the former neighbor, who having moved, leaves 
behind another empty seat (see Fig. 24-33). A third person moves now to 
occupy this seat and so on. The motion could be studied from two viewpoints: 
the individual motion to the right of each person could be considered (a many- 
body problem) or the motion of the vacant seat to the left could be investigated 
(a single-body problem). Clearly, the latter is more desirable. 

When we have a completely filled band, its contribution to the current 
in the presence of an electric field is 0, or 


J full = 0 


Let us now remove electron i from the top of the valence band (remember 
that it has a negative effective mass); the contribution to the current density 
] from the remaining electrons will be 


J remaining ~~ Jean 7 J i 


where J; is the contribution from the 7 electron. But Jun = 0, and therefore 
J einaisine = J i 
But J; = —|elv,, where v; is the drift velocity that the i electron would acquire 


from © and |e| is the magnitude of the charge of the electron. We know from 
our earlier discussion of conduction (see Eq. 23.12) that v; = a;t;, where a; is 
the acceleration of the i electron and 7; its relaxation time (time between 
collisions). Therefore 


jae a le| A;T; (24.10) 


Thus, the contribution to the electric current density resulting from the elec- 
trons that remain after electron i is removed is equivalent to that of a single 
charged particle. If we had removed two electrons, the current density would 
be equivalent to that of two charged particles. We cannot say positive charged 
particles yet because a; could be negative (that is, in the direction opposite to 
€). From Newton’s law, the acceleration of the 7 electron is 

_ alee 


+ 
mM; 


a; 


FIGURE 24-33 

Theater analogy of the behavior 
of electrons in the conduction 
band and holes in the valence 
band of a semiconductor. (From 
M.N. Rudden and J. Wilson, Ele- 
ments of Solid State Physics. Copy- 
right © 1980 by John Wiley & 
Sons, Inc. Reprinted by permis- 
sion of John Wiley & Sons, Inc.) 


But m? is negative because the i electron is from the top of the valence band, 
and therefore 


_ lelé 


=a er (24.11) 
|m?| 


i 
The acceleration a; is in the same direction as the electric field @. We can 
therefore say that the contribution to the current from the electrons that 
remain in the valence band when a certain number have been removed by 
thermal excitation or some other process is equivalent to that of the same 
number of positively charged holes with positive mass. The expressions in 
Eqs. 24.10 and 24.11 are what one would expect for a real positively charged 
particle with positive mass, as illustrated schematically in Fig. 24-34. There- 
fore, when calculating the current density J resulting from the valence elec- 
trons, we ignore them and instead consider only the few positive holes. The 
total current will be the sum of the contribution of the holes moving in the 
direction of the field and the electrons opposite to the field direction. A 
measurement of current with an ammeter cannot distinguish between the 
two charge carriers. 

The reality of holes can be experimentally verified by means of the Hall 
effect. As we showed in Chapter 16, Section 16.7, the Hall voltage V;, will 
yield information about the carrier concentration and the sign of their charge. 
This experimental information is usually given in tables of the Hall coefficient 
Ry = 1/Nqg, where N is the number of conduction carriers per unit volume 
and g is the charge of the carriers. In many solids, both electrons and holes 
participate in the conduction process, and thus the sign of Ry is determined 
by the predominant type. When listing Ry, the convention is that it is negative 
if the predominant type of carrier is the electron, and positive if it is the hole. 
Regardless of the complexities involved in the calculation of Ry when both 
types of carriers are present, it should be clear that R;, could never be positive 
if holes were not a reality. Some examples of the value of the Hall coefficient 
are listed in Table 24-2. 


TABLE 24-2 
Ru 
Solid (m?7/C) 
Lithium — 17 x 10-8 
Sodium — 25 x 107?! 
Beryllium + 24 x 107" 
Zinc + 3x 10°" 
+ 


Cadmium 6 x 107" 
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FIGURE 24-34 

Response of a positive charge q 
of positive mass m to an electric 
field @ resulting in a current den- 


sity J. 
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It can be seen in this table that some well-known metals conduct electricity 
with holes as the predominant charge carrier. 

We conclude this chapter with a comment about the behavior of holes. 
Consider the situation depicted in Fig. 24-32b, where a few electrons have 
been excited into the conduction band of the semiconductor leaving behind 
an equal number of holes at the top of the valence band. As we indicated 
earlier in this section, if additional energy is provided to the charge carriers 
of the semiconductor, such as by the application of an external electric field, 
the electrons in the valence band will move up into the empty states (the 
holes). When this happens, the holes move down from their initial position 
in the valence band. Because we have decided to ignore the electrons in the 
valence band and instead consider only the holes, we conclude that when 
energy is provided to the carriers in the valence band, the holes move down, 
or putting it differently, the energy of the holes increases downward from the 
top of the valence band. This fact will be used in Section 25.2a to calculate 


the hole concentration in the valence band of a semiconductor. 


PROBLEMS 


24.1 In the Kronig-Penney model, the relation be- 
tween the energy E and the momentum p of the 
electrons is given by the relation 


pon + cos yd = cos kd 
where 
_ mE, obd _ 2mE te _ P 
i ae 42 y= (FF) , and k=, 


Show that if E,o = 0, the energy spectrum becomes 
continuous and it is that of the free particle, 
hy2k2 p? 
2m 2m 


Use physical arguments to justify this result. 


24.2 Sketch three different wavefunctions that are 
degenerate (that is, correspond to the same energy) 
with the one of the third level of Fig. 24-21. 


»4.3. The solution of the Kronig Penney model shows 
that the width of the allowed bands increases as the 
energy increases (see Fig. 24-9). Use the arguments 


presented in Section 24-4 concerning the overlap of 
the atomic wavefunctions to explain this result. 


24.4 Use the arguments presented in Section 24-5 
to predict the band structure of (a) Li, and (b) Al. 
(c) Given the results predicted by these simplified 
arguments, how do you explain that aluminum is 
trivalent? 


24.5 Use the concept of band theory to explain the 
following observations: Most insulators are trans- 
parent to visible light, semiconductors are transpar- 
ent to infrared light but opaque to visible light, all 
metals are opaque to light of all wavelengths. 


24.6 The energy gaps of some alkali halides are 
KCl = 7.6 eV, KBr = 6.3 eV, KI = 5.6 eV. Which 
of these are transparent to visible light? At what 
wavelength does each become opaque? 


24.7. The experimentally determined value of the 
Fermi energy in Na is 2.50 eV. In Chapter 23 we 
showed that 


h* 212/3 
Er = — 
F= 5 [SN7*] 


Use the measured value of Er to calculate the effec- 
tive mass of the electrons. Na is monovalent. Its 
atomic weight and density are 22.99 g/mole and 

0.97 g/cm’, respectively. 
(Answer: 1.26 m, where m is the free electron 
mass.) 


24.8 The density of aluminum is 2.70 g/cm? and its 
molecular weight is 26.98 g/mole. (a) Calculate the 
Fermi energy. (b) If the experimental value of Er is 
12 eV, what is the electron effective mass in alu- 

minum? Aluminum is trivalent. 
(Answer: (a) 11.6 eV, (b) 0.97 m, where m is the 
free electron mass.) 


24.9 The experimentally measured specific heat 
of the conduction electrons in beryllium is C, = 
0.54 x 10~4 T calories/mole-K. (a) What is the Fermi 
energy for beryllium? (b) What is the electron effec- 
tive mass in beryllium? The density of beryllium is 
1.86 g/cm? and its molecular weight is 9.01 g/mole. 
(Answer: (a) 15.7 eV, (b) 0.92 m, where m is the 
free electron mass.) 


24.10 For a free electron, the relation between its 
energy and its momentum is given by 


_ 
ET om 


Because p = hk, the relation can be written as 


The dependence of E on k is parabolic. For an elec- 
tron moving in the periodic potential of the lattice 
ions, the relation between E and k is more compli- 
cated. As we have seen, the solution of the Schro- 
dinger equation for an electron moving in a one- 
dimensional periodic potential consisting of equally 
spaced potential wells yields the following relation 
between E and k 


sin yd 


arr, 


+ cos yd = cos kd 
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where 


mEpobd 


P= 12 


is a constant, b is the separation between the wells, 
d is the periodicity of the wells (see Fig. 24-4), and 
y = [2mE/h?]'*. This equation cannot be solved ex- 
plicitly for E(k). It can be solved numerically. That 
is, we pick a value of E, plug it into the equation, 
and find k. The procedure is repeated for other val- 
ues of E. This numerical solution yields a rather in- 
teresting result. There are ranges of E for which k is 
a real number. These ranges are separated by ranges 
of E for which k is imaginary. This happens because 
the left side of the equation is greater than 1 or smaller 
than —1. Because the right side is cos kd, and the 
cosine of a real number varies between 1 and —1,k 
is imaginary for those values of E for which the left 
side of the equation is greater than 1 or less than 
—1. The momentum of a particle cannot be imagi- 
nary; therefore, the electron cannot possess those 
values of E for which k is imaginary. This leads to 
the existence of allowed and forbidden energy bands 
in solids. 

(a) Write a computer program that will yield the 
allowed values of E and the corresponding values 
of k as well as the forbidden values of E (the values 
of E for which k is imaginary). This can be done by 
letting E vary between 0 and 200 eV in small incre- 
ments. It is suggested that in the range 0 to 
50 eV, AE = 0.2 eV, in the range 50 to 200 eV, 
AE = leV. 

(b) Plot E versus k for the first allowed energy 
band. Is the relation between E and k parabolic? 


Data 


5 
P=3T 


d=2A=2x 107m 
m=9.1 x 107-7! kg 
h = 1.05 x 107% J-sec 


24.11 To see how the width of the allowed bands 
depends on the interatomic separation, use the pro- 
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gram of Problem 24.10, with the proper modifica- 
tions, to find the width of the first allowed energy 
band for: (a) d = 1.0 A, (b) d = 1.5 A, (c) d = 
2.0 A. In all three cases assume that the product 
E,ob = 25 eV-A. (d) Do the results agree with the 
qualitative arguments of Problem 24.3? 


24.12 The width of the allowed bands depends also 
on the depth of the potential wells. Verify this with 
the program of Problem 24.10 by finding the width 
of the first allowed energy band for: (a) E,ob = 
50 eV-A, (b) E,ob = 75 eV-A, (c) E,ob = 100 eV-A. 
Assume d = 1A. 
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25.1 INTRODUCTION 


In the previous chapter we saw how band theory permits the classification 
of solids into metals, insulators, and semiconductors. The practical use of the 
first two is as old as civilization. The widespread application of semiconduc- 
tors, on the other hand, goes back only to the 1950s. The consequence has 
been a revolution in electronic technology that in turn has had large socio- 
economic repercussions. It is interesting to note that the fact which triggered 
this revolution was not a better understanding of the physics of semicon- 
ductors. The general model of a semiconductor was developed principally by 
A. H. Wilson and dates back to the 1930s. The reason for the technical rev- 
olution was the advent of refining techniques that have permitted the prep- 
aration of samples of Si and Ge with impurity concentrations of 1 part in 10°. 
Impurity concentrations as low as a few parts per million in most other solids 
are difficult to obtain. 

We will briefly discuss some of these preparation techniques in Chapter 
28. For the present, we will concentrate our attention on the electrical prop- 
erties of semiconductors, which is why they are such useful and interesting 
materials. 


25.2 INTRINSIC SEMICONDUCTORS 


An intrinsic semiconductor is one whose impurity concentrations are so low 
that its electric properties are not affected by them but are solely determined 
by the band structure of the material. 

In Section 22.2 it was stated that both Si and Ge have the tetrahedral 
diamond structure shown in Fig. 22-6. Each atom is surrounded by four 
nearest neighbors so that it is the center of a tetrahedron. It shares its four 
valence electrons equally with its four neighbors, thereby forming strong 
covalent bonds. The structure is so stable because of the strength of the bonds 
that at low temperature very few electrons are able to break loose from a bond 
and migrate in the crystal and participate in conduction processes. In fact, at 
T = 0 K both Ge and Si are insulators. 

Having learned about band theory, we can express this fact in another 
way. A semiconductor is a solid that at T = 0 K has a valence band completely 
filled with electrons, separated by a forbidden energy gap E, from an empty 
conduction band. However, the energy gap is small enough that at reasonable 
temperatures a small number of electrons will be thermally excited across the 
forbidden gap into the conduction band. Once there, these electrons, as well 
as the holes they leave behind in the valence band, can be accelerated by an 
electric field and measurable electrical conduction can take place. 


25.2a Electron and Hole Densities 


It should be clear that the most important factor determining the electrical 
conductivity of semiconductors is the number of electrons per unit volume, 


Londuction band 


Valence band 


N., that are excited into the conduction band at a given temperature T. How 
do we determine this number? Let us choose as the zero of energy the top 
of the valence band as shown in Fig. 25-1. The number of electrons per unit 
volume in the conduction band, N,, can be obtained by multiplying the density 
of states in each incremental energy interval by the probability of occupancy 
of that interval. The result will be the number of occupied states in that 
particular energy interval. We then integrate from the bottom to the top of 
the conduction band. 


top of conduction band 
N, = | (density of states) x (Fermi function) dE (25.1) 
bottom of conduction band 

In Chapter 23 we introduced the concept that the electrons in the con- 
duction band are arranged in energy states according to the Pauli exclusion 
principle and therefore they obey Fermi-Dirac statistics. We must introduce 
the modification derived in Chapter 24, that in order to include the interaction 
of the electrons with the periodic lattice potential, the free electron mass must 
be replaced by the effective mass m?. We can therefore use for the density of 
states 9(E) the expression that we derived in Chapter 23, Eq. 23.27. 


Q(E) = C, E”? (23.27) 
where | 
(2m?2)°'7a9 
C= 5732 (23.28) 


Note that Eq. 23.28 contains the term a’, which is the volume of the solid. 
We now wish to calculate N,, defined as the number of electrons per unit 
volume, and therefore the right side of Eq. 25.1 has to be divided by a*. This 
can be done by deleting a? from the expression for C,. In the derivation of 
Eq. 23.27, E was measured from the bottom of the conduction band. Because 
we have chosen the top of the valence band as E = 0, we must shift the zero 
point by an amount E, by subtracting it from E; therefore, 


g(E) = C.(E — E.)'? 


We substitute this expression for the density of states and Eq. 23.31 for the 
Fermi function into Eq. 25.1 and integrate. There are a couple of difficulties, 
however. Integrals involving the Fermi function are not particularly easy to 
integrate. Often they cannot be integrated analytically. The second difficulty 
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FIGURE 25-1 
Schematic of the conduction and 


valence bands of Si. The top of 
the valence band is chosen as the 
zero energy point; the energy of 
the bottom of the conduction 
band is E,. 
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is that we need to know the width of the band (the upper limit of the integral). 
What saves us is the fact that the Fermi level E; lies near the middle of the 
energy gap (this will be shown later). In the case of both Si and Ge, E, ~ 
1 eV. Therefore, because the bottom of the conduction band is E,/2 above Er, 
we estimate that E — Er = 0.5 eV. At room temperature (T = 300 K) kgI = 
0.025 eV. These quantities introduced into the Fermi function permit us to 
neglect the 1 in the denominator compared with the exponential term, which 
is much larger than 1. For example, for E = E,, that is, for the energy level 
at the very bottom of the conduction band, 


E — E; leV — 0.5 eV - 
= Se = = 10? 
<*P ( kT *P ( 0.025 eV : 


For higher energy values, the exponential term is even greater; therefore 


1 b= & 


The resulting exponential is an easier function to integrate. We see that in 
the low temperature, high energy approximation Fermi-Dirac statistics can 
be approximated by Maxwell-Boltzmann statistics (Chapter 9, Supplement 
9-1). Moreover, because F(E) decreases exponentially with E, the product 
9(E)F(E) will be appreciable only near the bottom of the conduction band 
where E is small. Conversely, the integrand will be negligible above certain 
energies. If we take E, = 1 eV, Er = 0.5 eV, T = 300 K, and we assume a 
band width of 1 eV (typical value), then 


1TeV — 0.5eV 


= _ ~ 1-9 
0.025 eV ) a 


F(Exottom) = exp (~ 


and 


2eV — 0.5eV 
F(Etop) = exp (- ow) = exp (— 60) ~ 10° *° 


Because F(E) at the top of the band is so small, we do not have to know the 
exact width of the band to integrate Eq. 25.1; we simply use ~ as the upper 
limit of integration. Eq. 25.1 is then written as 


se E — Ep 
ae — 1/2 i f 
N. = C. i. € E,)'2 exp ( a ) dE (25.1') 


This can be integrated readily by using a couple of simple tricks. Let us 
multiply the right side of Eq. 25-1’ by a form of unity that leaves the equality 
unchanged, namely, 


(kpT)>? - Ee rt.) 7 
(kg)? exp kT exp kT = ] 
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Eq. 25.1’ can then be written as 


Er -E 0 /F —~ E\'? E-E aE 
N. = C, (kpT)?” exp orm ) i ( LT ‘ exp G LT ; (=) 


If we let 


then 


oO 


E- -—E | 
_ 3/2 ee 1/2 4-x 
N. = C, (kpT) exp ( kT ) ; xe aX 
Notice that when E = E,, x = 0. The integral can be found in standard tables 
and is equal to V 7/2. hereon: the number of electrons per unit volume in 
the conduction band is 


1 (2m:\°° Ee= E ee em 
N, = 4 ES (kgT)°? exp =) (25.3) N, - N. ae | Kal ) 
rt arp 
Example 25-1 Sree 
l (2m? Kal 
The energy gap E, in silicon is 1.1 eV. The average electron effective mass is Ne = 4 ( po 


0.31 m, where m is the free electron mass. Calculate the electron concentration 
in the conduction band of silicon at room temperature, T = 300 K. Assume 
Er = E,/2. 

Solution From Eq. 25.3, 


3/2 
1] 2x0.31x9.1x 107° kg x 1.38 x 107 * J/K x 300 K 
4 (1.05 x 107% J-sec)* a 


Ep be 
x exp iT 


N,. = 4.36 X 107*m7° X exp [- 


N, = 


0.55eV x 1.6 x 10° J/eV 
1.38 X 10-23 J/K x 300K 


N, = 2.6 X 10 per m? 
A typical metal has ~ 1078 free electrons per m’. 


In Chapter 24, Section 24.5, we indicated that the magnitude of E, de- 
termines whether a solid is an insulator or a semiconductor. Equation 25.3 
shows that, at a given temperature, the number of electrons in the conduction 
band decreases exponentially with increasing E,. We can readily see why 
diamond (E, ~ 6 eV) is an insulator whereas Si (E, ~ 1 eV) is a semiconductor. 
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From Eq. 25.3, the ratio of N, in diamond (C) to that in silicon (Si) is (remember 


that Er a ba) 
: 3/2 
7 & ©) (kgT)? exp (- EO) 


N(C) 4\ fa 2kpT 
NASi) 1 (2m? (Si)\* i _ E,(Si) 
i( fear ) aly” eXP | Oh eT 


where m2(C) and m(Si) are the electron effective masses in diamond and 
silicon respectively, E,(C) is the energy gap for diamond and E,(Si) the energy 
gap for Si. If we neglect differences in the effective masses as being much 
smaller than the difference in the exponential terms, the ratio becomes 


ae (- et) 
NAC) _ P\S 2 kel 7 _ 2.5 eV 
NASi) xe (- 1 v) 7 kpT 
2keT 


At room temperature kgT = 0.025 eV; therefore 


N.C) 2.5 eV ; 
= Pee eee _ = 1 =~ 1 44 
NASi) ( 0.025 i) i a 


The density of conduction electrons in diamond is 44 orders of magnitude 
smaller than in Si. In Example 25-1 we found that at room temperature the 
electron density in Si is N, ~ 10'° per m®. We conclude that at room temper- 
ature there are essentially no electrons in the conduction band of pure dia- 
mond, and therefore it is an insulator. 

Let us now calculate the density of holes. In order to calculate N, from 
Eq. 25.3 we needed to know Er. We assumed, without proof, that for an 
intrinsic semiconductor Er = E,/2. We can obtain an expression for N, that 
is independent of Er and at the same time find E; by calculating N, (the 
number of holes per unit volume in the valence band) in a similar manner to 
the calculation for N.. We then set N, = N, and we will be able to obtain Er. 

The probability of having a hole at some energy E in the valence band 
is the same as the probability of the absence of an electron at that same energy; 
thus if F;, is the hole probability, 


F,=1-F,=1- 


Simplifying, we obtain 


and, on dividing the numerator and denominator by the numerator and 
eliminating the resulting negative exponential by reversing the order of E and 
Er, we have 


Notice that because E corresponds to energies in the valence band it is neg- 
ative. Because Er = +E,/2, the argument of the exponential is always positive. 
Moreover, because E, — E >> kgT, we may neglect the 1 in the denominator 
because it is much smaller than the exponential term. We may thus approx- 
imate F,, by the Boltzmann distribution function as we did for the electrons, 
that is, 


_ E = Ep 
F,(E) ~ exp iT | (25.4) 
We know that the holes behave as positive charged particles with energies 
that increase downward from the top of the valence band just as the electron 
energies increase upward from the bottom of the conduction band. We can, 
by analogy to the electron case, write an expression for the density of hole 
states in the valence band. Note from Fig. 25-1 that the E = 0 position is at 
the top of the valence band and therefore g(E) will not be a function of 
(E — E,)"?, as was the case for the electronic states in the conduction band. 
Instead it will be a function of (— E)"? (the minus sign is included because E 
is negative in the valence band and g(E) must be real). 


(2mi,)>" 


gi(E) = Cy (—E)"? where C, = “S535 


(25.5) 


Note that since we are calculating the number of holes per unit volume, we 
have deleted a* from the expression for C;,. It should be noted that the effective 
mass of a hole is not expected to be the same as that of an electron. The 
number of holes per unit volume will be given by the integral 


) P | 

Nie J ~_ A{E) F,(E) dE (25.6) 
We have used —~ as the lower limit of the integration instead of the energy 
of the bottom of the valence band for the same reason as before: The expo- 


nential term decreases rapidly as we move away from E = 0. Substituting 
Eq. 25.5 for g, and Eq. 25.4 for F;, into Eq. 25.6, we obtain 


0 E — Ey 
—_ 1/2 
i. C,(— E)"2 exp 7 dE 


E 0 E \'2 E aE 
Seep | - | (ko? | (- at) a (=) (=) 


Nn 


Nn 
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Letting x = —E/kgT, we write 
” _ Er 3/2 1/2 p-x (_ 
N, = C; exp (kpT) x"* e~* (—dx) 
kel 


The minus sign in the integrand can be eliminated by inverting the limits of 
integration. When this is done, we get the same integral that we had before, 
which was equal to V7/2. Substituting for C, from Eq. 25.5 and rearranging 
terms yields 


_ 1 (2mn 3/2 Er 
N; = ri (=) (kgT)** exp aT (25.7) 


The expression for N;, just as that for N,, depends on Er, which has not been 
determined. However, the product N,N; is independent of the Fermi energy. 


1\ /1\ (2m?\°? (2mp\?? Er — Ey Er 
Ness = (3) (3) (ie) (x) Gm? x er ar) ov (in) 
L/2\ E 
ge 3lay* a7*)3/2 ye 
16 (es) ea eine ( o 
= 1 kel ° % 7 ¥\3/2 E. 
NN; = 5 Ge) (mem;)"'* exp kT 


Note that this product does not depend on the Fermi energy. Because we are 
talking about intrinsic semiconductors, the only source of electrons for the 
conduction band is the valence band. Consequently, N, = Ny, = (N.N;)*?, 
and we may write that 


1\*? kpT 7 % 4 ¥\3/4 Ee 
N, = Nz = (3) (FE, (mem;,)"" exp ( ~ 355) (25.8) 


This gives us N, and N;, in terms of some universal constants, the temperature 
and three intrinsic parameters of the semiconductor: mz, mj, and Eg. 


25.2b The Fermi Level! 


We are now ready to locate the Fermi level of an intrinsic semiconductor. As 
we will see in the next chapter, the position of E; is crucial to the understand- 
ing of semiconductor devices. 

We have found two different expressions for N,, Eq. 25.3 and 25.8. If we 
equate them, 


1 me kpT = %aeq%\ 3/4 Ee 
(5) es i ie 


2me - 3/2 Er — Ee 
(ze) - ox ( ksT 


N, = N, exp ( a 


where 


For an intrinsic 
semiconductor 


N . = IN fy 
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Upon simplification, we get 


E Er -E 
% y7%*\3/4 sul a *\3/2 F g 
(mpm?) exp ( er) (me)"’* exp ( LT 


which can be rewritten as 


Er _ (mimi a Es 
“Pet Vm) Por 


Taking the natural log of both sides, we obtain 


x 
Ep = “ + = ksT In = (25.9) _— Is : 
It is seen in this equation that at T = 0K, Er = E,/2, which was our original 
assumption. For T > 0K, E; is still equal to E,/2 if mz = mj, because In 1 = 0. 
But even if mi + mj, Er ~ E,/2 at ordinary temperatures unless the two 
masses differ greatly: This is because In x is a slowly varying function of x. 
In Si and Ge, m3 ~ mj; thus Er ~ E,/2. 

The concept of the Fermi energy was first introduced in connection with 
electrons in a three-dimensional potential well; we defined Er as the demar- 
cation line between filled and empty states. This could be done at T = 0 K. 
For temperatures greater than 0 K such a separation was not clear cut. Er had 
to be redefined. We did this (see Section 23.3e) by means of the Fermi-Dirac 
distribution, and we saw that E- was that energy for which the probability 
of occupation was 3. The fact that in the case of a semiconductor Ey lies in 
the gap, does not imply that we have electrons in the gap. This would con- 
tradict the results of band theory. The probability that an energy level in the 
gap is occupied can be finite without having electrons in it. We have seen 
that the number of electrons in an energy interval dE is given by the density 
of available energy states in that dE multiplied by the probability that those 
states be occupied (i.e., the Fermi-Dirac function). In the gap the Fermi-Dirac 
function is finite, but the density of states is zero. There are no available 
states, and hence no electrons. 

We can see that it makes sense to have Er = E,/2. Ina metal at T = 0K 
all the electrons are in energy states below Er. As the temperature is increased, 
electrons can be excited to higher energy states. However, we saw in Chapter 
23 that only those electrons near Er have a reasonable chance of being excited. 
Physically, we can say that Er is the reference energy level from which charge 
carriers are more likely to be excited. With this interpretation in mind, let us 
turn our attention to semiconductors. When an electron is excited from the 
top of the valence band to the bottom of the conduction band, it leaves behind 
a hole; an amount of energy equal to E, is needed for the transition to take 
place. We could look at this process as a one-particle process (an electron 
making the transition) or, having agreed to treat holes as proper charged 
particles, we could look at it as a two-particle process (a simultaneous exci- 
tation of an electron into the conduction band and of a hole into the valence 
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band). We assume that each particle needs an amount of energy E,/2 to make 
the transition. The excitation can be viewed as an upward transition of an 
elecctron from the middle of the energy gap and a downward transition of a 
hole from the same reference point, remembering that the energy of holes 
increases downward. This is illustrated schematically in Fig. 25-2. 


25.3 EXTRINSIC OR IMPURITY SEMICONDUCTORS 


The electrical properties of a semiconductor can be changed drastically by 
adding minute amounts of suitable impurities to the pure semiconductor 
crystal. Thus, the addition of 1 atom of boron to 10° atoms of Si raises the 
conductivity of the sample by 10° at room temperature. The most commonly 
used impurities are elements of group V (P (phosphorous), As (arsenic), Sb 
(antimony), . . .) and group III (B (boron), Al (aluminum), Ga (gallium), . . .) 
of the periodic table (see Section 21.7b). The reasons for the choice follow. 


25.3a Donor and Acceptor Energy Levels 


Because the impurity concentration is usually small, we may suspect that the 
lattice structure is hardly changed from that of the pure crystal. Our suspicion 
is confirmed by X-ray studies of the lattice structure. The same studies show 
that the impurity atoms enter the crystal lattice substitutionally, that is, the 
impurity atom replaces a lattice atom. Figure 25-3 illustrates a substitution of 
a phosphorous atom, which has five electrons in its outer shell, into a planar 
representation of a lattice of silicon atoms, which have four valence electrons 
each. 

Let us now consider what happens when an atom from group V replaces 
a silicon atom in the lattice structure. We know that each silicon atom is 
normally bound by four covalent bonds with its four nearest neighbors; that 
is, it shares its four valence electrons with them. When the group V atom 
replaces a silicon atom, it will use four of its own electrons for the covalent 
bonding. There remains, however, an extra electron. This electron (we will 
show later) will not be very tightly bound to its parent nucleus (~ 107 eV), 
and consequently the impurity atom can be easily ionized and the extra elec- 
tron will be free to move through the crystal lattice. This loosely bound 
electron can contribute to conduction processes. We can rephrase this state- 
ment in the language of band theory as follows. The extra electron from the 
impurity atom at T = 0 K occupies an energy level Ep that lies ~ 107? eV 
below the conduction band; that is, it takes only about 10~? eV to ionize the 
impurity atom (see Fig. 25-4). As the temperature is raised, this electron can 
be excited into the conduction band where it can contribute to electrical con- 
duction. Such electrons are said to be donated by the impurity atom, and for 


Conduction band 


A Valence band A 


FIGURE 25-2 

In an intrinsic semiconductor, the 
Fermi level is located approxi- 
mately in the middle of the for- 
bidden energy gap between the 
valence and the conduction 
bands. An electron making a 
transition from the valence band 
into the conduction band leaves 
behind a hole(1). The transition 
can also be looked at as the si- 
multaneous upward transition of 
an electron into the conduction 
band and a downward transition 
of a hole into the valence band, 
both originating at the Fermi 
level(2). 


FIGURE 25-3 
Planar representation of the sil- 
icon lattice structure where a 
phosphorous atom has taken the 
place of a silicon atom. After us- 
ing four of its five valence elec- 
trons to complete the four co- 
valent bonds, the phosphorous 
atom is left with an extra electron 
that can be easily freed and thus 
can contribute to electrical con- 
duction. 
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this reason the energy level Ep is called a donor level. Note that when this 
donor atom is ionized, no hole is created in the valence band. 

We should emphasize that these donor electrons provided to the con- 
duction band are in excess of those which may have been excited across the 
energy gap from the valence band. This implies that we now have more 
negative charge carriers (electrons in the conduction band) than positive ones 
(holes in the valence band). For this reason, a semiconductor doped (that is, 
into which impurities have been introduced) with donor impurities is called 
an n-type semiconductor. 

Let us backtrack a little and try to calculate the energy with which the 
extra electron is bound to the impurity atom. Equivalently, we want to see 
how far below the bottom of the conduction band Ep lies. We can obtain an 
estimate that is in fairly good agreement with the experimentally measured 
value by using a rather crude model. According to this model, the excess 
electron is held to the impurity atom by a net charge of +e (the net charge 
resulting from the + Ze charge of the nucleus shielded by the remaining 
Z—1 electrons). The problem is reduced to a hydrogen-like atom. There are, 
however, two important modifications that must be made. In the isolated 
hydrogen atom, the electron orbits the nucleus in free space. Here the excess 
electron interacts with the impurity atom while moving in a medium of silicon 
atoms. This implies two things: 


1. The dielectric constant x (Section 14.7) of the medium must replace the 
dielectric constant of free space, k = 1. 


2. We must use the effective electron mass m? rather than the free electron 
mass. 


Keeping these two facts in mind, we can easily evaluate the energy that 
binds the extra electron. Both the Bohr model and the quantum mechanical 
model of the hydrogen atom give for the binding energy of the electron (Eqs. 
18.16 and 21.9). 


e*m 
= ——— = 13.6eV 
8(KEo)2n2 ° 
In silicon kép = 12 €&) and mz = 0.31 mM. Consequently, 


0.31 
(12)° 


E = (13.6 eV) 


I 


E = 0.029 eV 


This is the energy necessary to free the excess electron from the impurity 
atoms, or in the language of band theory, we can say that the impurity energy 
level Ep lies 0.029 eV below the bottom of the conduction band. For the 
purpose of comparison, the thermal energy at room temperature is kgT = 
0.025 eV. This means that at ordinary temperatures quite a few of these 
impurities can be ionized; that is, the electrons will be in the conduction band. 


Conduction band 


E, — Ep ~ 10-2 eV 


Valence band 


FIGURE 25-4 


At T = 0 K the extra electron 
from the phosphorous atom of 
Fig. 25-3 occupies an energy level 
Ep (donor level) that lies close to 
the bottom of the conduction 
band. This electron can easily 
jump into the conduction band 
when provided with thermal en- 


ergy. 
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The experimentally measured values of Ep agree quite well with the value 
found here. | 

When Si is doped with impurity atoms from group III, it is again found 
that the impurity atom takes the place of a silicon atom in the crystal lattice. 
These impurities, being trivalent, cannot complete the tetravalent bond scheme. 
A vacancy (a hole) is created. At T = 0 K this vacancy remains localized; that 
is, it stays with the impurity atom. However, as T is raised, electrons from 
adjacent Si atoms may jump into the vacancy of the impurity atom. The 
vacancy can now migrate through the crystal. Just as before, we can rephrase 
the situation in band theory terminology: An energy level E, lies above the 
valence band. At T = 0 K this level is empty, and the valence band is full. 
As thermal energy becomes available, electrons from the valence band can 
jump into the empty impurity levels, thus creating mobile holes in the valence 
band. Calculations similar to the one for pentavalent impurities, as well as 
experimental results, show that E, lies relatively close to the top of the valence 
band (between 0.04 and 0.1 eV in Si and ~ 0.01 eV in Ge) (see Fig. 25-5). 
Because these impurity atoms may accept electrons, they are called acceptor 
impurities, and E, is called an acceptor level. The holes created by the presence 
of the acceptor impurities are in addition to those created by thermal exci- 
tations across the energy gap. Thus a semiconductor doped with acceptor 
impurities has more positive charge carriers (holes) than negative ones (elec- 
trons). These impurity semiconductors are called p-type semiconductors. 


25.3b Carrier Density and 
Fermi Level in Impurity Semiconductors 


We have already indicated in the preceding section that the presence of im- 
purity states will significantly affect the number of electrons in the conduction 
band or holes in the valence band. They will also very dramatically affect the 
position of the Fermi level. For simplicity, we will discuss the case of a semi- 
conductor with one type of impurity. 

Let us consider an n-type semiconductor and try to answer the same 
question as before: What is the concentration of electrons in the conduction 
band N, and the concentration of holes in the valence band N,, at a given 
temperature? | 

When we derived the expressions for N, and N, for the intrinsic semi- 
conductor, Eqs. 25.3 and 25.7, we simply multiplied the density of states 
9(E)dE in an incremental energy interval by the probability of occupancy in 
that interval, that is, by the Fermi-Dirac function F(E). Neither g(E) nor F(E) 
are affected by the introduction of impurities; therefore, the expression for 
N, and N,, for doped semiconductors will still be given by Eqs. 25.3 and 25.7. 
However, although the expressions for N, and N, are the same as those for 
the intrinsic semiconductor, their numerical value is not the same. The reason 
is that the position of the Fermi level Er in the doped semiconductor is not 


Conduction band 
E= Ey 


ba 10-4 ev 
Valence band 
FIGURE 25-5 


The introduction of trivalent at- 
oms in the silicon lattice struc- 
ture of Fig. 25-3 gives rise to an 
empty energy level E, (acceptor 
level) near the top of the valence 
band. With little thermal energy, 
electrons from the valence band 
can jump into this level, thus cre- 
ating holes in the valence band. 
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the same as in the intrinsic semiconductor. The way we obtained E, in the 
intrinsic case was by equating N, and N,,. This is fine if the only source of 
electrons for the conduction band is the valence band. However, this is not 
true for impurity semiconductors. In an n-type semiconductor electrons can 
be excited into the conduction band from the donor levels as well as from 
the valence band. How do we determine the Fermi level in this case? The 
condition we use is that the crystal must be electrically neutral. More explicitly, 
the number of electrons in the conduction band N, must be equal to the 
number of holes in the valence band plus the number of donor impurity 
atoms per unit volume that have been ionized, Nj. This condition is written 
as For an n-type semiconductor 


N, = N, + No (25.10) N. = N, + Na 


Eq. 25.10 is often referred to as the neutrality equation. Our first task is to find 
Nj. This can be done in the following way. Suppose there are Np donor 
impurity atoms per unit volume. There will be Np electrons in the energy 
level Ep that may make the transition to the conduction band. If we multiply 
Np by the probability that an electron not be at Ep, we should have the number 
of ionized donor impurities Nj. 


Ni = Np (1 — F(Ep)) (25.11) 


If we substitute Eq. 25.3 for N., Eq. 25.7 for N,, and Eq. 25.11 for N& in Eq. 
25.10, we get 


1 
= N, exp | — a + Np | 1 - (25.12) 
eA ex Fp Er +1 
PV kT 
where 
1 /2m* kpT\*? 
ee 4 ( ha ) 
and (25.13) 
_ 1 2m}, kpT le 
na 4 ( ha ) 


Eq. 25.12 must be solved to obtain Er. If we could solve it analytically, we 
would obtain E; in terms of T and the properties of the semiconductor, that 
is, E, and Np. Unfortunately, it cannot be solved analytically, so it is solved 
numerically. The results from such calculations are shown in Fig. 25-6. The 
figure shows the variations in E-; with temperature for different impurity 
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concentrations. The curves 1, 2, 3 correspond to increasing values of donor 
impurities Np. The dashed line represents the value of Er for the intrinsic 
semiconductor with the assumption that mj, > mz. As indicated in Section 
25.2b, Eq. 25.9, for the intrinsic semiconductor, at low temperatures E; ~ 
E,/2. As it is raised to high temperatures, the second term of Eq. 25.9 becomes 
significant and E; increases (because In mj/m? is positive) from its low tem- 
perature value. This is the reason for the upward curvature of the dashed 
line of Fig. 25-6. The general features of the results are: 


1. At low temperatures the Fermi level lies halfway between Ep and the 
bottom of the conduction band; that is, 


_ by Ep 
2 
As the temperature increases, Er approaches the value for the intrinsic 


semiconductor. Note that “low T”’ is a relative expression; this statement 
can hold at room temperature, depending on the type of semiconductor. 


Er (25.14) 


2. The transition between these two regimes occurs at higher temperatures 
as the impurity concentration is increased. 


We have found the location of Er with a mathematical argument. Now 
that we have the answer, we can understand the physical origin of the result. 
We know that the position of the donor level Ep from the bottom of the 
conduction band (the ionization energy of the impurities) is ~ 0.03 eV for Si. 
We also know that E, = 1.1 eV. By now, we are also familiar with the fact 
that the probability of a transition across an energy gap E,,, is given by (see 
Eq. 25.8) 


E 
is sateen 
P ~ exp vad 
What this means is that the probability that an electron will jump from the 
donor level to the conduction band across a small energy gap, E, — Ep = 


0.03 eV, is many orders of magnitude greater than the probability of making 
a transition from the valence band to the conduction band across a large 


FIGURE 25-6 

The Fermi level in an n-type 
semiconductor as a function of 
temperature (solid lines). At low 
temperatures, E; lies halfway be- 
tween the donor level Ep and the 
bottom of the conduction band 
E,. At high temperatures, E; ap- 
proaches the value of the intrin- 
sic semiconductor (dashed line). 
The three lines labeled 1, 2, and 
3 correspond to increasing val- 
ues of the impurity concentra- 
tion. 
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energy gap E, = 1.1 eV. We may write the ratio of these probabilities as 


exp | — 
Probability from the impurity level _ 2kgT 
Probability from the valence band E. 
exp | — 
2kpT 


- _ 0.03 | 
ee ee ee BA 
7 ~ &*P | ORT 


At room temperature kgT = 0.025 eV; this ratio becomes exp (21.4) ~ 10”. 
Even though there are many more electrons in the valence band than in the 
donor levels (~ 1078 valence electrons/m°’ compared with ~ 10%? impurity 
electrons/m*, assuming an impurity concentration of one part per million), 
the fact that the probability of transition from Ep is so much greater allows 
us to conclude that at room temperature practically all the electrons in the 
conduction band come from the impurity levels. This means that at low 
temperatures, the semiconductor behaves as an intrinsic semiconductor with 
an energy gap E, — Ep. Thus we can expect that just as in the case of the 
true intrinsic semiconductors, Er will be in the middle of that gap. This is 
what the solution of the neutrality equation yielded. At high temperature, all 
the impurities will be ionized and, because we have many more electrons in 
the valence band than in the impurity levels, the number of electrons in the 
conduction band will be much greater than Np. The contribution of the im- 
purities will thus be negligible, and the material will behave like a real intrinsic 
semiconductor; we may expect that Er will be the same as that of the intrinsic 
semiconductor. This is in fact what happens. Clearly, the larger the impurity 
concentration, the higher the temperature at which the contribution from the 
electrons excited from the valence band becomes more significant than that 
of the impurity electrons. 

It can be inferred from our discussion what will happen in the p-type 
semiconductor. Acceptor levels of energy E, exist very close to the top of the 
valence band. At low temperatures transitions from the valence band to these 
levels will dominate the transitions to the conduction band. The impurity 
levels accept electrons from the valence band, thereby permitting hole con- 
duction. We may therefore expect Er to be half way between the top of the 
valence band and E,. As the temperature increases, these impurity levels will 
become filled, and transitions to the conduction band become the most im- 
portant source of charge carriers. The material then behaves like an intrinsic 
semiconductor. We can argue that Er will tend toward the intrinsic semicon- 
ductor value. This is in fact what happens. A plot of Er versus T for a p-type 
semiconductor is shown in Fig. 25-7. Just as in Fig. 25-6, the curves 1, 2, 3 
correspond to increasing impurity concentration. The reason for the upward 
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curvature of the dashed line is the same as the explanation given in connection 
with Fig. 25-6. 

We conclude this section with one important observation that should be 
obvious by now. We have indicated that one of the effects of the impurities 
was to break the equality that existed in the intrinsic semiconductor between 
electrons and holes. In the n-type semiconductor, owing to the contribution 
from the donor states, there are more electrons in the conduction band than 
holes in the valence band. Moreover, we have shown that at low temperatures 
most of the electrons in the conduction band come from the impurity levels. 
This in turn means that in the n-type semiconductor electrons are much more 
abundant than holes (see Example 25-2). For this reason, in an n-type semi- 
conductor, the electrons are called the majority carriers, and the holes are 
named minority carriers. Obviously, in the p-type semiconductor the situation 
is reversed. 


Example 25-2 


A sample of Si is doped with phosphorous. The donor impurity level lies 
0.045 eV below the bottom of the conduction band. At T = 300 K, E; is 
0.010 eV above the donor level. Calculate (a) the impurity concentration, (b) 
the number of ionized impurities, (c) the free electron concentration, and (d) 
the hole concentration. (For Si, E, = 1.100 eV, mz = 0.31 m, m;, = 0.38 m). 


Solution 


(a) From Eq. 25.12, 
Ey = Ep Er 
— —&__—] = N, eee 
N, exp ( kT ) exp ( a 


(25.12) 


FIGURE 25-7 

The Fermi level in a p-type semi- 
conductor as a function of tem- 
perature (solid lines). At low 
temperatures, E; is halfway be- 
tween the acceptor level E, and 
the top of the valence band. At 
high temperatures, FE, ap- 
proaches the value of the intrin- 
sic semiconductor (dashed line). 
The three lines labeled 1, 2, and 
3 correspond to increasing val- 
ues of the impurity concentra- 
tion. 
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From Eq. 25.13, 
yy, = 2 | 2d ket we 
4| Wa 


_ (2 x 0.31 x 9.1 x 1071 kg x 1.38 x 10°” J/K x 300 K)*? 
7 4 x (1.052 x 107% J?-sec? x 2)?” 


= 4.39 x 1074 m-° 


and 


wy, = 2 [2th ket - 
4| fen 


7 4 xX (1.052 x 10-8 J?-sec* x amy? 


= 5.95 x 1074 m~? 


Substituting for N., N,, Er, and Ep into Eq. 25-12 (keep in mind that 
energies are measured from the top of the valence band), we get 


449% Wee (- (1.100 — 1.065) ) 


0.025 eV 
1.065 eV 
es 24 m,-3 or a 
5.95 x 10** m exp ( 0.025 wv) 
1 
+ Np] 1 - 


oo (_ onl eV) , 

P\~ 0.025 eV 
1.08 x 1024 m-3 = 1.88 x 10®m~3 + Np (0.40) 
Np = 2.7 X 1074 m~° 


(b) The number of ionized impurities is given by the second term of the 
right side of Eq. 25.12 


1 
Ns = Np = 
ex ee + 1 
P\ kT 
7 1 
Ns = 2.7 X 1077 m3] 1 —- 


oo (_ 0:010€V) | 
P\~ 0.025 eV 


1.08 x 1074 m~* 
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(c) The free electron concentration is equal to the left side of Eq. 25.12, 
which from part (a) of this example is 


N. = 1.08 x 1074 m~3 


(d) The hole density is found in the first term of the right side of Eq. 
25-12 


N, = 1.88 x 10° m-? 


29.4 ELECTRICAL CONDUCTIVITY OF SEMICONDUCTORS 


In Chapter 23 on free electron theories we saw that for a conductor, except 
at very low temperatures, the electrical conductivity o decreased with in- 
creasing temperature as 0 « T~'. For an intrinsic semiconductor, the con- 
ductivity increases exponentially with increasing temperature, 


Oo = Oo exp(—a/T) 


where a is a constant. This is illustrated schematically in Fig. 25-8 where a 
plot of the natural log of a versus the inverse of the temperature yields a 
straight-line relationship. 

In the case of a doped semiconductor, the same exponential behavior is 
observed. However, the constant a has two different values. This is imme- 
diately obvious if we look at the graph of Fig. 25-9, which shows the exper- 
imentally observed variation of o with temperature. The graph clearly shows 
that at low T’s and high T’s In a varies linearly with the inverse of the tem- 
perature. The slope is much greater for high T’s than for low T’s. 

We can understand these results if we keep in mind our discussion of 
the carrier concentration and the position of the Fermi level. In Chapter 23 
we found that | 


(23.14) 


where N was the number of free charge carriers (at that time electrons), q 
was the charge of the carriers, 7 was the time between collisions with ions, 
and m was the mass of the carriers (at that time the true mass of the electron). 
Because we now have two types of charge carriers, electrons and holes, which 
contribute to the electrical conduction, the expression for o must be modified 


to 
| N, 2 N 
oe lee Ea Fe | 


% 


25.1 
me ni (25.15) 


where 7, and 7; are the mean times between collisions with ions for electrons 
and holes, respectively, mz and mj their effective masses, and |e] is the mag- 
nitude of the electron charge because conductivity does not distinguish be- 
tween types of carriers. 


ee, 
FIGURE 25-8 

Experimentally observed tem- 
perature dependence of the elec- 
trical conductivity of an intrinsic 
semiconductor. 


+ / 
FIGURE 25-9 
Experimentally observed tem- 
perature dependence of the elec- 
trical conductivity of an impurity 
semiconductor. 
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Let us first consider the case of intrinsic semiconductors. In this case 
N. = N, and N, can be factored from Eq. 25.15 


ip Be 4st 
o = |e? N. E 2 | (25.16) 
Unlike the case of a metal where N, was constant and the temperature de- 
pendence of o was hidden solely in the time between collisions with ions 
(7 « T~+), in semiconductors both N, and 7 vary with T. Calculation of 7 for 
silicon and germanium has shown that 7 « T~°*. The carrier concentration 
N., on the other hand, depends exponentially on T, and it is therefore much 
more sensitive to changes in temperature. If we substitute N, of Eq. 25.8 in 
Eq. 25.16, we obtain 


E 7, Th, 
_ 3/2 ae, eee a ae : 25.17 
ao = constant T°? exp sits | (% =.) ( ) 


Using the T~°? power law mentioned previously for 7, the multiplicative 
temperature terms cancel and the conductivity may be written as 


E 
o = constant exp | - 5 | (25.18) 
B 


where the constant is now modified by the sum of the reciprocal effective 
masses. A plot of the In a versus T~* yields a straight line with slope — E,/2kg. 
The experiment (see Fig. 25-8) confirms this prediction. Moreover, the value 
of E, obtained from conductivity measurements agrees with that found by 
other means such as optical absorption, which we will discuss later. 

Let us now examine impurity semiconductors. We will consider n-type 
semiconductors. Similar arguments will apply to the p-type. In discussing the 
position of the Fermi level in n-type semiconductors, we indicated that at 
high temperatures, when all the impurities have been ionized, their contri- 
bution to N, becomes negligible compared with the contributions from exci- 
tations from the valence band. As a result, the crystal behaves like an intrinsic 
semiconductor. What we have shown about the conductivity of the intrinsic 
semiconductor holds here too: The In o versus T~? will be a straight line with 
slope — E,/2kg. 

At low temperatures the situation is quite different. There, most of the 
electrons in the conduction band originate from the impurity levels, and as 
a result N, >> N,. We can ignore the contribution of the holes to the con- 
ductivity, and Eq. 25.15 can be approximated as 


ee Nee 

0 nt 

If we substitute Eq. 25.3 for N, and put in the temperature dependence of 7 
as T°, we obtain 

(E, — Ef) 


kpT 


ao = constant exp [- 


AAT 


For an intrinsic semiconduc- 
tor at all T’s and for a doped 
semiconductor at high T’s. 
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We have shown previously (Fig. 25-6) that at low temperature Er lies halfway 
between the donor levels and the bottom of the conduction band; that is, 
Er = (Eg + Ep)/2, Eq. 25.14. Using this result, we may write the expression 
for the conductivity as 


(E, 7 Ep) 


aca (25.19) 


a = constant exp [- 
Thus, at low temperatures In o will vary linearly with T~* and the slope will 
be —(E, — Ep)/2kz. We recognize E, — Ep as the impurity ionization energy 
that we know is much smaller than the gap energy E,. Therefore, the slope 
of the curve at low temperatures-should be much smaller than at high T’s. 
The expected behavior of o is shown in Fig. 25-10. The actual experimental 
data, Fig. 25-9 shows a dip between the high and the low temperature regimes. 
At intermediate temperatures, most of the impurities will have been ionized, 
and consequently there will be no change in their contribution to N. with 
variations in T. The temperatures are still too low (kgT << E,) to have a 
significant number of electrons excited from the valence band across the 
energy gap. As a consequence, the temperature dependence of o will be 
controlled by the collision time 7. Because 7 decreases with increasing T 
(r « T~%?), we would expect a decrease in a as T is raised. This is the expla- 
nation for the dip in Fig. 25-9. 


25.5 PHOTOCONDUCTIVITY 


We conclude this chapter with some brief remarks about the behavior of a 
semiconductor when it is exposed to electromagnetic radiation. Being by now 
familiar with the concept of photons, one can immediately think of a simple 
technique to measure the energy gap of a semiconductor: Illuminate the semi- 
conductor with photons of different frequencies and observe the ability of the 
sample to absorb them. 

The experimental arrangement is rather simple. Light from a device in 
which one can select a given wavelength of light, called a monochromator, 
strikes a thin slab of material under study. The amount of radiation trans- 
mitted through the sample is then measured by means of a light meter (Fig. 
25-11). The amount absorbed is found by subtracting the intensity of the 
transmitted beam from that of the incident beam. The experiment is repeated 
for different wavelengths to find the dependence of the absorption on the 
energy of the photons. The experimental results for insulators and semicon- 
ductors show negligible absorption for long wavelengths (small v) and then 
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FIGURE 25-10 
Theoretical temperature depen- 


dence of the electrical conductiv- 
ity of an impurity semiconduc- 


tor. 


For an n-type semiconductor 


at low T's 


‘ Eo ae En: 
a exp ( — — —) 


FIGURE 25-11 
Schematic of the setup used to 


measure the absorbtion of elec- 
tromagnetic radiation by a semi- 


conductor. 


a sudden rise, which is called the absorption edge (see Fig. 25-12). The wave- 
length A, at which the abrupt change takes place depends on the material 
under consideration. For example, for Si, A, ~ 1.1 X 107° m (infrared). 

The results can be easily explained in terms of the band model of the 
semiconductor. If the energy of the incident photons hvis less than the energy 
of the gap E,, the electrons will not be able to be excited from the valence 
band into the conduction band; the photons will not be absorbed by the 
electrons, and the radiation will go through the sample. If, however, hv = 
E,, electrons will be excited across the gap as a result of the absorption of a 
photon. Notice that unlike the case of absorption by isolated atoms, where 
hy must be exactly equal to the energy difference between atomic energy 
levels, in this situation hv can be greater than E,. The reason is simple: There 
are plenty of empty energy levels in the conduction band. This explanation 
can be easily checked by equating hv to E, for the wavelength of the absorption 
edge in silicon. 


he _ (6.63 x 10~* J-sec) (3 x_10° m/sec) 


een 7 1.1 x 10-¢m 


1.8 x 10-%J =1.1eV 


I 


This agrees well with the value obtained from the temperature dependence 
of the electrical conductivity. 

A useful application of the property of transmission at wavelengths above 
the absorption edge is the examination of the perfection of the interior of 
large crystals of silicon. While visible light is blocked, infrared is transmitted 
and can be observed with a suitable detector. 

Because optical absorption creates extra electrons and holes, the electrical 
conductivity of the semiconductor will be enhanced when illuminated with 
radiation of the proper frequency. This effect is called photoconductivity. It has 
a large number of practical applications, such as automatic controls for night- 
lights and light meters in cameras. When a light beam is aimed at the pho- 
toconductor, current can be made to flow in a circuit. Moving objects between 
the light source and the photoconductor will interrupt the current and thereby 
be detected. These systems are called photoelectric cells. 

One of the most familiar applications of photoconductivity is the process 
of xerography. A high resistance semiconductor, usually selenium, is applied 
as a coating to a metal surface. The metal serves as the plate, which is often 
in a cylindrical form. Ionization of the air near the surface by a high intensity 
light of suitable wavelength puts a charge on the exposed surface of the 
selenium coating. Selenium has a resistivity in the dark of the order of 10 
ohm-m, so the charge stays on the surface. However, when a portion of the 
surface is illuminated, the photoconductivity causes the charge at that part 
to leak through to the metal base plate, leaving that region of the selenium 
uncharged. Thus the image of a printed page projected on the selenium 
surface would discharge all parts except those on which the black printing 
was projected. A dark resin powder is then dusted on to the selenium surface, 
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FIGURE 25-12 

Experimentally observed absorb- 
tion of electromagnetic radiation 
by a semiconductor as a function 
of the wavelength of the radia- 
tion. Note the sharp change in 
absorbtion at A,. 
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and it clings to the charged region, that is, that onto which the printing was 
projected. This powder is then transferred to a piece of paper and fused by 


the application of heat. 


PROBLEMS 


25.1 The band gap in pure germanium is E, = 
0.67 eV. (a) Calculate the number of electrons per 
unit volume in the conduction band at 250 K, 300 K, 
and at 350 K. (b) Do the same for silicon assuming 
FE, = 1.1 eV. The effective mass of the electrons in 
germanium is 0.12 m and in silicon 0.31 m, where m 
is the free electron mass. 


25.2 Suppose that the effective mass of holes in a 
material is four times that of electrons. At what tem- 
perature would the Fermi level be shifted by 10% 
from the middle of the forbidden energy gap? Let 
EF, = 1levV. 

(Answer: 557 K.) 


25.3 The energy gap in germanium is 0.67 eV. The 

electron and the hole effective masses are 0.12 m and 

0.23 m, respectively, where m is the free electron 

mass. Calculate (a) the Fermi energy, (b) the electron 
density, and (c) the hole density, at T = 300 K. 

(Answer: (a) 0.348 eV, (b) 4.1 x 10’ m~°, 

(c) 4.1 * 107° m~?.) 


25.4 In the crude model used in Section 25.3a to 
calculate the binding energy of the excess electron 
to the donor impurity atom, we assumed that the 
electron moves in a medium of silicon atoms that 
screen the donor impurity ions; that is, we use the 
dielectric constant x = 12 of silicon. (a) Justify this 
assumption by calculating the radius of the Bohr 
orbit of the excess electron (see Eq. 18.14), (b) Esti- 
mate the number of silicon atoms in a sphere with 
that radius. The nearest neighbor separation in sil- 
icon is 2.34 A. | 

(Answer: (a) 20.4 A, (b) 2700 atoms.) 


25.5 (a) Starting with Eq. 25.3 and 25.7, show that 
the product N,N; is a constant; that is, it is not af- 
fected by the introduction of impurities in the pure 
semiconductor. (b) The number of electrons in the 
conduction band can be greatly increased by the in- 


troduction of donor impurities. These two facts im- 
ply that the number of holes is greatly reduced when 
the semiconductor is doped with donor impurities. 
Explain the physical reason for the reduction in hole 
concentration. 


25.6 Consider a sample of germanium doped with 
phosphorous. Assume that the excess electron re- 
volves around the impurity ion P* in a hydrogen- 
like orbit. Calculate (a) the ionization energy of the 
excess electron, (b) the radius of the orbit (see Eq. 
18.14). The dielectric constant « in germanium is 16 
and the electron effective mass is 0.12 m. 
(Answer: (a) 6.5 X 1073 eV, (b) 70 A.) 
25.7 Silicon is doped with an acceptor type im- 
purity such as boron or gallium. The impurity con- 
centration is N, atoms per unit volume. Write the 
neutrality equation, similar to Eq. 25.12, from which 
the Fermi energy can be determined. 


25.8 Germanium (E, = 0.67 eV) is doped with gal- 
lium. The acceptor impurity level lies 0.011 eV above 
the top of the valence band. (a) What is the impurity 


_ level concentration if E;, at room temperature (T = 


300 K), coincides with the acceptor level? (b) Cal- 
culate the fraction of ionized impurities (mz? = 
0.12 m, m;, = 0.23 m). 

(Answer: (a) 3.6 x 1074 m~%, (b) 50%.) 


25.9 Referring to the situation described in Prob- 

lem 25.8, (a) calculate the concentration of holes and 

electrons, (b) show that the product N,N; is the same 
as the one found in Problem 25.3. 

(Answer: (a) N, = 1.8 x 1074 m~%, 

N. = 9.0 X 103% m~3.) 


25.19 (a) Show that the most probable value of the 
energy for an electron in the conduction band of an 
intrinsic semiconductor is $ksT above the bottom of 
the conduction band. (b) What is the average energy 


of an electron in the conduction band of an intrinsic 
semiconductor? 


25.11 In Table 25-1 we list the conductivity of pure 
germanium at different temperatures. (a) Make a 
plot of In o versus 1/T. (b) Determine E, for ger- 
manium. 


TABLE 25-1 
Problem 25-11 


o (Q-1m-?) T(K) 
2 300 

13 350 

52 400 

153 450 


362 500 


25.12 The conductivity of pure germanium in- 
creases by 50% when the temperature is increased 
from 20°C to 30°C. (a) What is the energy gap E, 
between the conduction and the valence bands of 
germanium? (b) For silicon E, = 1.1 eV, what is the 
percentage charge in the conductivity for the same 
temperature change? 

(Answer: (a) 0.62 eV, (b) 150%.) 


25.13 A certain intrinsic semiconductor has a band 
gap E, = 0.2 eV. Measurement shows that it has a 
resistivity at room temperature (300 K) of 0.3 Q-m. 
What would you predict its resistivity to be 
at 350 K? 

(Answer: 0.17 Q-m.) 


25.14 (a) What is the relative number of electrons 
N. and holes N;, in a doped semiconductor for which 
the electrical conductivity o (see Eq. 25.15) is a min- 
imum at a given temperature? (b) What is the ratio 
N-./N; if the scattering time 7 is the same for electrons 
and holes and m?/mj = 1.5? (Hint: The product N,N,, 
at a given temperature is a constant, see Problem 
25.5). | 

(Answer: (a) N./N, = Tt, me/t. my, (b) 1.5.) 


25.15 The resistivity of a sample of n-type silicon 
at 300 K is 9 x 107? O-m and its Hall coefficient is 
3.9 x 10~* m°/C. (a) Assuming that electrical con- 
duction is due solely to electrons, what is the free 
electron concentration? (b) How does the answer to 
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(a) compare with the result of Example 25-1? Is the 
assumption in part (a) justified? (c) The effective mass 
of the electrons mi = 0.31 m. What is the electron 
scattering time 7,? 

(Answer: (a) 1.6 X 1072 m~3, (c) 7.6 x 107” sec.) 


25.16 The energy gap in silicon is 1.1 eV, whereas 
in diamond it is 6 eV. What conclusion can you draw 
about the transparency of the two materials to visible 
light (4000 A to 7000 A)? 


25.17 A cadmium sulfide (E, = 2.4 eV) photode- 
tector is illuminated with light of wavelength 3000 A. 
The intensity of the radiation falling on the detector 
is 30 W/m2. The area of the detector is 9 mm/7. (a) 
Show that electron-hole pairs will be generated by 
the light. (b) Assuming that each photon produces 
an electron-hole pair, how many pairs will be gen- 
erated every second? 


25.18 When a photon of energy E >> E, enters a 
semiconductor, it can produce several electron-hole 
pairs; that is, it can excite several electrons from the 
top of the valence band to the bottom of the con- 
duction band. A germanium crystal is used as a 
gamma ray (high energy photon) detector. (a) What 
is the maximum number of electron-hole pairs cre- 
ated by a 1.5 MeV gamma ray? (b) If the resolution 
of the detector is + 4 x 10° electron-hole pairs, what 
is the optimal energy resolution of the detector? 

(Answer: (a) 2.24 x 10°, (b) 2.7 x 10° eV.) 


25.19 When a semiconductor is doped with a do- 
nor type of impurity, the number of electrons in the 
conduction band is equal to the number of electrons 
excited from the valence band into the conduction 
band (the number of holes) plus the number of im- 
purity atoms that have been ionized. This leads to 
the equation (See Eq. 25.12) 


Ee = 
Noe | ar 
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where 


and 


Equation 25.12 is an equation for the Fermi energy 
E; (dependent variable) as a function of the temper- 
ature T (independent variable). The equation cannot 
be solved analytically. It can be solved by numerical 
methods with the aid of a computer. This is done as 
follows: We choose a value for T, and then we find 
the value of E; that makes both sides of the equation 
equal. Actually, because the problem is solved by 
numerical methods, it is practically impossible to 
find the value of E; that will make both sides of the 
equation exactly equal. We can approximate the 


problem by requiring that E; be such, that both 
sides differ from each other by a small amount, for 
example, 1%. 

Write a computer program that will yield E; for 
different T’s. Do this for T = 10 K — 2500 K. After 
finding E,, substitute it in the expressions for N, and 
N,, to obtain the free electron and hole densities. 

The following data and hints may be helpful: 


Np is the impurity concentration. Take it to be 10” 
atoms/m’, 
E, for silicon is 1.10 eV. 


Ep is the impurity energy level. Assume it is 
0.04 eV below the bottom of the conduction 
band, that is, E, — Ep = 0.04 eV. 


E; lies between 1.1 eV and 0.55 eV. 
m* = 0.31 m; mt = 0.38 m 
kg = 1.38 x 10°" J/K = 8.63 x 10°°eV/K 


CHAPTER 26 


Semiconductor 
Devices 
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26.1 INTRODUCTION 


In Chapter 25 we discussed the electrical properties of semiconductors and 
how these properties are affected by the introduction of different types of 
impurities in a pure silicon sample. We showed that the free electron con- 
centration is significantly increased by doping pure silicon with a small amount 
of phosphorous or other pentavalent element. We call these semiconductors 
n-type semiconductors. Similarly, the hole concentration is greatly enhanced 
when silicon is doped with trivalent impurities such as boron. This results in 
the formation of a p-type semiconductor. 

In this chapter we will consider how these two types of semiconductors 
are used to create the semiconductor devices—diodes and transistors—that 
are fundamental to the performance of such functions as rectification, am- 
plification, and switching in electronic circuits. These devices are the basic 
building blocks of computers and other digital instruments. 


26.2 METAL-METAL JUNCTION: 
THE CONTACT POTENTIAL 


Before we consider the operation of semiconductor devices, let us consider 
what happens when two dissimilar metals are joined together. This will bring 
out two important facts that we will use later. 

Let us recall a couple of facts that we discussed in connection with the 
photoelectric effect (Section 17.3). We saw that the kinetic energy with which 
the electrons were ejected from a metal was not uniform. The electrons came 
out with a continuous spread of energies. The explanation given was that 
electrons in the metal were bound with different amounts of energy. The 
smallest binding energy was called the work function ¢ of the metal. The 
other fact, which is pertinent to our present discussion, is that different metals 
have different work functions. For example, the photoelectric effect occurs 
with lower energy photons, that is, lower frequency, in potassium than it 
does in copper. 

We can explain these results rather simply in terms of the quantum 
mechanical model of electrons in a solid. The electrons occupying the highest 
energy levels in a solid (those most loosely bound) are the electrons in the 
conduction band. These electrons, as we have indicated in Chapter 24, behave 
according to the predictions of the quantum mechanical free electron model; 
that is, they occupy quasicontinuously all the energy levels available to them 
from the bottom of the band up to the Fermi level Ey. One reasonable mod- 
ification to the quantum mechanical free electron model is to change the 
infinite potential well to a finite one. An infinite potential well would imply 
that an infinite amount of energy is needed to liberate these electrons, which 
is experimentally not true. If we take as our zero energy the vacuum, that is, 
when the electrons are outside the metal, the energy diagram for the con- 
duction band can be represented as in Fig. 26-1. It is now clear that the 
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FIGURE 26-1 

Energy diagram for the electrons 
in the conduction band of a metal. 
The zero energy level is chosen 
to be the energy outside the metal 
(vacuum level). The electrons at 
E; need the least amount of en- 
ergy to come out of the metal. 
This minimum energy was first 
introduced in Chapter 17 as the 
work function ¢ of the metal. 
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meaning of the work function is simply the energy difference between the 
vacuum level and the Fermi level. The electrons at E; require the least photon 
energy to be ejected to the outside. An electron below the Fermi level requires 
a photon of correspondingly higher energy to be ejected. If this higher energy 
photon is absorbed by an electron at the Fermi level, however, it will have 
more energy than @, the required energy to leave the metal. This extra energy 
is the kinetic energy of the ejected electron. Because kinetic energy is a func- 
tion of the velocity, when photons with energy hv > ¢ are used, the emitted 
electrons will have a distribution of velocities. The fact that the work function 
is not the same for all metals implies that the position of the Fermi level 
relative to the vacuum level is different for different metals. 

Let us now consider two metals, 1 and 2, with work functions ¢; and 
gz, respectively, and let ¢; < 2. When the two metals are far apart, their 
conduction bands will look as in Fig. 26-2. If these metals are placed in contact, 
electrons are free to flow from one to the other. Because the electrons near 
the Fermi level of metal 1 have higher energy than those in metal 2, there 
will be a net flow of electrons from metal 1 to metal 2. This flow will continue 
until the highest occupied energy level is the same in both metals, that is, 
until both metals have a common Fermi level. Figure 26-3a illustrates that the 
Fermi levels of the two metals in contact are both changed from their values 
in isolation to a common level. This is analogous to the situation where two 
containers filled with water up to different levels are interconnected. Water 
will flow from the container where the level is higher (and therefore the 
potential energy of the water is higher) into the other container, until the 
water level is the same in both containers. 
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FIGURE 26-2 

Energy diagram for the electrons 
in the conduction band of two 
dissimilar isolated metals with 
different work functions. Note 
that E;-,; is closer to the vacuum 
level than is E;, and therefore 
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FIGURE 26-3 

(a) The two metals of Fig. 26-2 
are brought in contact. Electrons 
near the top of the Fermi level of 
metal 1 will move into the lower 
energy levels above the Fermi 
level of metal 2 until the maxi- 
mum energy of the electrons in 
both metals is the same, that is, 
until the Fermi level is the same 
in both metals. (b) The flow of 
electrons from metal 1 into metal 
2 leaves the surface of the latter 
negatively charged whereas that 
of metal 1 remains positively 
charged. This results in a poten- 
tial difference V. between the two 
metals. 
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As a result of the flow of electrons, metal 2 will gain some electrons and 
become negatively charged whereas metal 1 will become correspondingly 
positively charged because of the loss of electrons. The net result is that an 
electrical potential difference will be established across the junction. Metal 1, which 
is positively charged, will be at a higher potential than metal 2. This potential 
difference is called the contact potential V,. Figure 26-3b shows that there has 
been a shift in voltage between the two metals in contact as a result of the 
shift in electron concentration from one to the other. 

There is an important principle here: The Fermi levels of two conducting 
solids in contact must be equal. Although we have shown this for two metals, 
the conclusion holds for semiconductors and is the result of more general 
thermodynamic arguments. It should be noted, however, that this is an equi- 
librium condition that can be varied by the application of an external voltage. 
As will be shown later, such controlled variation is basic to the operation of 
the diode and the transistor. 


26.3 THE SEMICONDUCTOR DIODE 
26.3a Contact Potential: Band Scheme of a p-n Junction 


The simplest semiconductor device is the p-n junction, or semiconductor diode, 
a device that permits current flow in only one direction. It consists of a 
p-type semiconductor placed in contact with an n-type semiconductor (see 
Fig. 26-4a). The electronic symbol of the diode is shown in Fig. 26-4b. Although 
in practice the junction is not formed by physically putting side by side a 
piece of n-type semiconductor and a piece of p-type, we will assume this for 
the present time. Some of the methods used to make p-n junctions are dis- 
cussed in Chapter 28. 

Before contact is made, the n region has a large number of free electrons 
(electrons in the conduction band) provided by the donor impurities. The 
charge of these mobile electrons in the n-type semiconductor is neutralized 
by the space charge of the positive donor ions, thus guaranteeing charge 
neutrality. Similarly, in the p type semiconductor, there is a large number of 
mobile holes whose charge is neutralized by the space charge of the negative 
acceptor ions. When the two types of semiconductors come in contact there 
will be a net diffusion of electrons from the n region into the p region. This 
is analogous to what happens when two gas containers are interconnected. 
If the density of molecules (and therefore the pressure) is not the same in 
both containers, there will be a net diffusion of molecules from the high 
density container to the low density one. Having crossed the junction, these 
electrons find themselves in a region where there are lots of holes with which 
they can and do recombine. While this is going on, the holes in the p region 
are diffusing across the junction into the 1 region; once there, they recombine 
with the electrons of the conduction band of that region. The net effect of 
this double diffusion and recombination is to deplete the region near the 
junction, on both sides, of its majority mobile carriers, thus making it highly 
resistive. At the same time, the immobile donor ions on the n side and the 
acceptor ions on the p side are left charged, thus creating charged layers on 


FIGURE 26-4 

(a) Physical composition of a 
semiconductor diode. (b) Elec- 
tronic symbol of a semiconduc- 
tor diode. 
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both sides of the junction, a positive layer in the n-type region and a negative 
layer in the p-type region. The situation is illustrated in Fig. 26-5. The diffusion 
of majority carriers across the junction in each direction ends quickly. The 
reason is that the charge layers at the junction create an internal electric field 
€ that prevents further diffusion, see Fig. 26-5. Thus, equilibrium is estab- 
lished and no net current across the junction takes place. 

This picture of how equilibrium is established is a kinetic one. The sit- 
uation can also be analyzed from an energy point of view. The charge layers 
create a potential difference V. between both sides of the junction, the n side 
being positively charged will be at a higher potential (voltage) than the p side, 
which is negatively charged. We may expect that the potential will vary 
monotonically in the depletion region (see Fig. 26-62). This potential difference 
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FIGURE 26-5 

Formation of a depletion layer (re- 
gion without mobile charge car- 
riers) at the p-n junction resulting 
from the diffusion of electrons 
from the n region into the p re- 
gion and diffusion of holes from 
the p region into the n region. 


FIGURE 26-6 

(a) Potential difference V, result- 
ing from the positive donor ions 
in the n side of the depletion layer 
and the negative acceptor ions in 
the p side of the depletion layer. 
(b) Potential energy barrier faced 
by the majority charge carriers 
(electrons) in the n side of the 
diode as they attempt to cross 
the junction. (c) Potential energy 
barrier faced by the majority 
charge carriers (holes) in the p 
side of the diode as they attempt 
to cross the junction. 
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is analogous to the contact potential that appeared in the metal-metal junction 
discussed in the previous section. 

We know that when a charge q is placed at a point of potential V, the 
potential energy of the charge E, = qV (Eq. 14.16). Therefore, when an elec- 
tron is in the n-region its potential energy E, = —|e|V., whereas in the 
p-region the potential energy FE, = 0, that is, the potential energy of the 
electrons in the p side of the junction is shifted upward relative to the n side 
by an amount |e|V,; this is illustrated in Fig. 26-6b. For the holes, the opposite 
is true. In the n side, the potential energy of the holes is E, = +|e|V., whereas 
in the p side E, = 0, see Fig. 26-6c. These potential energy barriers stop the 
further flow of majority carriers across the junction. Thus, one of the con- 
sequences of the contact potential is the upward shift of the electronic energy 
bands in the p side, relative to the bands in the n side, by an amount |e|V,. 
This shift is associated with the requirement that the Fermi level be the same 
on both sides of the junction. The shifting of the bands is illustrated in Fig. 
26-7. The fact that E- must be the same on both sides determines the value 
of the contact potential. Because the position of the Fermi level in each side 
relative to its band structure (for example, relative to the top of its conduction 
band) does not change on contact, the energy shift in the bands |e|V. must 
be equal to the difference that existed between the two Fermi levels before 
contact, that is, 


le|\V. = Ern — Erp (26.1) 
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FIGURE 26-7 

(a) Electronic energy bands of the 
n-type and the p-type semicon- 
ductors before contact is made. 
(b) Upward shifting of the elec- 
tronic energy bands in the p side 
relative to the n side after the 
junction is formed. The shifting 
is due to the increase in the po- 
tential energy of the electrons in 
the p region relative to the n re- 
gion (see Fig. 26-60). 


where Er, and E;, are the Fermi levels in the n side and in the p side, re- 
spectively. In a doped semiconductor the position of the Fermi level depends 
on the impurity concentration and on the temperature (see Section 25-3b); 
the contact potential V, will be determined by these two parameters. Typically, 
for a silicon diode V, has values that range between 0.6 and 0.9 V at ambient 
temperatures. 


26-3b Equilibrium Currents Across the p-n Junction 


In the preceding section we saw that when the p-n junction is formed a 
potential energy barrier |e|V, is formed that stops the flow of majority carriers 
across the junction. This is a dynamic equilibrium. Electrons and holes con- 
tinuously flow across the junction; the net flow is zero because equal amounts 
flow in opposite directions. Let us consider only the flow of electrons as an 
illustration. There are electrons in the conduction band of both the n-type 
and the p-type semiconductors. In Chapter 25, Section 25-2a, we showed that 
the number of electrons N, in the conduction band is (Eq. 25.3), 


E, — Er 
“ar ) 

Although Eq. 25.3 gives the free electron concentration for both types of 
semiconductors, the numerical value of N,, at ambient temperatures, is several 
orders of magnitude smaller for the p-type than for the n-type. The reason 
is that E, — E,; is much greater for the p-type than for the n-type (see Fig. 
26-8). 

The electrons (minority carriers) in the conduction band of the p region 
are not impeded by the potential energy barrier from crossing the junction 
from the p side to the n side. The electron current from p to n, i(p > n), will 
be proportional to the total number of electrons in the p region. We may write 
the proportionality of this current to N, as an equality by introducing A as 


N. = N, exp ¢ (25.3) 
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FIGURE 26-8 

After the junction is formed, an 
equilibrium is established where 
there is no net flow of electrons 
(or holes) across the junction. 
Electrons do flow across the 
junction. The relatively few (mi- 
nority carriers) electrons in the p 
side are not impeded by the en- 
ergy barrier from crossing the 
junction. This flow is compen- 
sated by the flow (in the opposite 
direction) of those electrons in the 
n side with energies E > |e|V.. 
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the proportionality constant: 


i(p—>n) = Aexp [ ar) (26.2) 


where E, = E, — Ey of the p side (see Fig. 26-8). 

In the n side there are a large number of electrons (majority carriers) in 
the conduction band. However, only those having energy equal to or greater 
than the barrier energy |e|V, will be able to cross the junction from the n side 
to the p side. The electron current, i(n — p), associated with this flow will be 
proportional to the number of electrons in the n region with energies greater 
than or equal to |e|V,. 


i(n—> p) = AN. f(E 2 lelV.) (26.3) 


where N, is the total number of electrons in the conduction band of the n 
side and f(E = |e|V.) is the fraction of these electrons with energies greater 
than or equal to |e|V.. In Chapter 25, Eq. 25.2, we indicated that for the 
electrons in the conduction band of silicon, at ambient temperatures, the 
Fermi-Dirac distribution can be approximated by the Maxwell-Boltzmann dis- 
tribution. Moreover, in Supplement 9-1 of Chapter 9, we showed that for a 
system of particles having the Maxwell-Boltzmann distribution of energies, 
the fraction of particles with energies greater than or equal to a given value 
EF; is given by the Boltzmann factor, exp (— E,/kgT), Eq. 9.41. Therefore, in 
the present case, the fraction of electrons having energies greater than the 
barrier |e|V, is 


V. 
f(E = |elV.) = exp (- re) (26.4) 


Substitution of Eq. 25.3 for N, and Eq. 26.4 for f(E = |e|V.) in Eq. 26.3, yields 


FE, + |elV, 
i(n— p) = A exp (~ == EM) 


(26.5) 
where E, = E, — E; in the n region (see Fig. 26-8). From Fig. 26-8, it is seen 
that E, + |e|V. = E,, and it follows that 
i(n > p) = (pn) 

The flow of electrons from n to p is equal to the flow from p to n. Therefore, 
the net current is zero. Although the preceding calculations have not yielded 
any startling results, we can draw an important conclusion from them. The 
electron current from p to n (that is, associated with minority carriers) is not 
affected by the height of the potential energy barrier because once an electron 
is in the conduction band there is no barrier in the p to n direction. On the 
other hand, the electron current from n to p (associated with majority carriers) 
depends by a negative exponential on the height of the barrier, Eq. 26.5. It 
is this property that makes the p-n junction a rectifier; that is, current flows 
easily in one direction but not in the other. This is considered next. 


26-3c Voltage-Current 
Characteristics of a Diode: Rectification 


Let us now consider what happens to the potential energy barrier at the p-n 
junction when an external voltage Vp is applied across the diode. The two 
possible ways this voltage can be applied are illustrated in Figs. 26-9a and 26- 
9b. In the first case (Fig. 26-9a) the diode is said to be forward biased, and in 
the second case (Fig. 26-9b) it is said to be reverse biased. We have seen in Fig. 
26-5 that the contact region between the 7 and the p sides is depleted of its 
majority carriers. This depletion region therefore has a high resistance and, 
as a consequence, the potential difference Vo will appear almost entirely across 
it (see Eq. 15.15). In the forward biased case (Fig. 26-9a) the battery lowers 
the potential of the n side of the junction relative to the p side by Vo. Because 
initially (before Vo was applied) the potential of the n side was higher than 
the p side by V, (see Fig. 26-62), the difference in potential between the n side 
and the p side is now reduced to V. — Vo. This is shown in Fig. 26-10a, where 
the solid line represents the new potential difference and the dashed line 
represents the potential difference before Vo is applied. 

Concomitant with the reduction in the potential difference between both 
sides of the junction, there is a lowering of the potential energy barrier en- 
countered by the majority carriers as they try to cross the junction. In the n 
region, the potential energy of the electrons is now E, = —|e\((V. — Vo), 
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(b) 
FIGURE 26-9 
Two possible ways of connecting 
a diode to an external source of 
potential difference Vo. (a) For- 
ward biased diode. (b) Reverse 
biased diode. 


FIGURE 26-10 

Forward biased diode. (a) Poten- 
tial difference between the 1 and 
the p sides of the junction in the 
forward biased case of Fig. 26- 
9a. The battery has reduced the 
potential difference from the 
value V, (dashed line) in the un- 
biased case (see Fig. 26-6a) to 
V. — Vo (solid line). (b) New (solid 
line) and old (dashed line) po- 
tential energy barrier for the elec- 
trons in the n side of the junc- 
tion. (c) New (solid line) and old 
(dashed line) potential energy 
barrier for the holes in the p side 
of the junction. 
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whereas in the p region it is E, = 0. The new energy barrier |e|(V. — Vo) 
encountered by the electrons as they attempt to cross from the n side to the 
p side is represented schematically by the solid lines of Fig. 26-10b. The dashed 
lines in Fig. 26-10b represent the old energy barrier |e|V.. Similar remarks 
apply for the holes. In the 1 region, their potential energy E, = |e|(V. — Vo) 
whereas in the p region E, = 0. The new (solid lines) and the old (dashed 
lines) energy barriers encountered by the holes as they attempt to cross the 
junction from the p side to the n side are shown in Fig. 26-10c. The lowering 
of the energy barrier will greatly increase the flow of majority carriers crossing 
the junction, thus giving rise to a large current through the diode. We will 
return to this shortly. First, however, let us consider what happens when the 
diode is reverse biased (Fig. 26-9b). In this case, the battery will raise the 
potential of the 1 side relative to the p side by Vo. Because the potential 
of the n side was already higher than that of the p side by V., the poten- 
tial difference between the two sides of the junction will now increase to 
V. + Vo. As a result, the potential energy barrier encountered by the majority 
carriers will also increase from |e|V, (unbiased condition) to |e|(V. + Vo) (reverse 
biased condition). Figure 26-1la shows the new (solid lines) and the old 
(dashed lines) potential differences between the two sides of the junction. 
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FIGURE 26-11 

Reverse biased diode. (a) Poten- 
tial difference between the n and 
the p sides of the junction in the 
reverse biased case shown in Fig. 
26-9b. The battery increases the 
potential difference from V. 
(dashed line) in the unbiased case 
to V. + Vj (solid line). (b) and (c) 
The potential energy barriers 
faced by electrons in the n side 
of the junction and by the holes 
in the p side increase from |e|V. 
to |el(V. + Vo). 


The associated potential energy barriers for electrons and holes are shown in 
Figs. 26-11b and 26-11c, respectively. Again the solid lines represent the energy 
barriers in the reverse biased condition and the dashed lines in the unbiased 
condition. In sum, we have shown that the height of the energy. barrier is 
lel(V. — Vo) when the diode is forward biased and |e|(V. + Vo) when it is 
reverse biased. If we adopt the sign convention that Vo ts positive in the forward 
biased case and negative in the reverse biased case, we can make the following 
statement that applies to both bias conditions: When an external voltage Vo is 
applied across the diode, the height of the potential energy barrier at the p-n junction 
is |e|(V. — Vo). 

_ Let us now consider the crucial question: What is the net flow of charge 
carriers across the junction under forward and reverse bias conditions? Again 
for the sake of simplicity, we will limit our discussion to the flow of electrons. 
The net electron current from the n side to the p side will be 


i= in— p) — (p—n) (26.6) 


The first term i(n — p), which represents the flow of electrons (majority 
carriers) from n to p, is proportional to the number of electrons in the con- 
duction band of the 1 region having energies greater than the height of the 
energy barrier, which now is |e|(V. — Vo). This number, as we saw earlier, 
Eq. 26.3, is equal to the total electron density N, multiplied by the Boltzmann 
factor exp(— E,/kgT), where in this case E; is the barrier energy |e|(V. — Vo); 
that is, 


E, + |e\((V. — ) (26.7) 


in — p) = Aexp (- kT 
where E, = E, — Er in the n side (see Fig. 26-8). The second term of Eq. 26.6, 
i(p —> n), which represents the flow of electrons (minority carriers) from p to 
n, is the same as in the unbiased condition, because there is no barrier for 


the electrons in the p to n direction. Thus i(p — n) is still given by Eq. 26.2. 
Substituting Eq. 26.7 for i(n — p) and Eq. 26.2 for i(p — n) in Eq. 26.6, we 


obtain 
a E, ss lel(V.. = Vo) E, 
1 = Aexp ( rn A exp aT 


Recalling that E, + |e|V. = E, (see Fig. 26-8), we write 
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we obtain 


(26.8) 


| = lo 


exp ( 4 — = 


We note that ip is the current associated with the flow of minority carriers 
(electrons) from the p side to the n side (Fig. 26-8) because EF, is unchanged 
by the applied voltage. It is called the reverse saturation current. In a silicon 
diode at room temperature, the magnitude of ip is typically ~ 0.1 wA. This 
is a very small value, and the reason is that the supply of electrons in the 
conduction band of a p-type semiconductor is very small because E; is large 
compared with the room temperature thermal energy kgT. 

Equation 26.8 is called the diode equation. Let us see what it tells us. Recall 
that in the derivation of Eq. 26.8, we used the convention that Vo is taken as 
positive in the forward biased case and negative in the reverse biased case. 
Because Vo is negative when the diode is reverse biased, the exponential term 
of the equation becomes negligible for relatively small voltages, and a constant 
small electron current i ~ —ig remains. (The negative sign means that the net 
electron flow is from p side to n side.) We do not need a very large voltage 
to reach this situation. At room temperature, kgT = 0.025 eV. If we take 
Vo = 0.1 V, theni = ie~* — 1) = — 0.98 ig. On the other hand, if the diode 
is forward biased, the current increases very rapidly with increasing voltage. 
For example, if 


Vo = 0.1 V, 7 = ip(e* — 1) = 541g 
whereas if 
Vo =0.5V, i = igfe?? — 1) = 4.8 x 108 ig 


In practice, the current does not rise quite so rapidly because of the resistance 
of the body of the semiconductor material as well as other resistances present 
in the circuit. In most circuits used in computers and other digital instruments, 
the current is limited by resistors in the circuit to a few milliamps. As a result, 
the voltage across a forward biased diode is almost never greater than a few tenths of 
a volt; 0.6 V or less is a fairly typical value for a silicon diode. The behavior 
described by Eq. 26.8 is illustrated in Fig. 26-12. 

The discussion of the direction of the electron current referred only to 
the direction of motion of the electrons. As indicated in Chapter 15, Section 
15.3, the direction of the conventional current is that of the hypothetical 
motion of positive charges, which is, of course, opposite to the direction of 
motion of the electrons. This conventional concept, that in a circuit the current 
is from the positive terminal of the battery to the negative one, is a holdover 
from the early days of electricity before the discovery of the electron, and it 
is still in use. Thus, in the forward bias case, the net electron flow is from 
the 7 region to the p region, whereas the conventional current would be from 
the p to the n (see Fig. 26-92). 

Holes have not been discussed for the sake of simplicity. It is not difficult 
to go through arguments for the holes similar to those for electrons and to 
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FIGURE 26-12 
Current-voltage characteristics for 


a biased diode. Positive values of 


Vo correspond to the forward 
biased case and negative values 
of V, to the reverse biased one. 
The value of ip in the figure is 
highly exagerated compared with 
the value of the current in the 
forward biased condition. 
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show that their behavior, under forward and reverse bias, is identical to that 
of the electrons. Of course, the flow of holes is in the opposite direction to 
that of the electrons, and so is the potential barrier. The expression for the 
hole current is the same as that obtained for the electrons. 

Two final comments should be made about the width of the depletion 
layer. 


1. Thé width of the depletion layer increases as the potential difference across 
the junction increases. In Section 26-3a, we saw that a contact potential V, 
is created by the ions on both sides of the junction when the majority 
carriers diffuse across the junction. Clearly, the greater the number of 
charged ions (and therefore, the wider the depletion layer), the greater the 
potential difference across the junction will be. Conversely, if the potential 
difference across the junction is increased, such as by the application of a 
reverse bias voltage Vp (see Fig. 26-11a), the number of charged ions on 
both sides of the junction will also increase. 


2. For a given junction potential, the depletion layer is wider for the semi- 
conductor that is the least doped. The reason for this is that a greater depth 
has to be depleted to match the number of carriers that can be obtained 
from a small depth of a highly doped semiconductor. These two facts about 
the width of the depletion layer will be important in understanding the 
behavior of field effect transistors (FET), which will be discussed later. 


26.4 THE BIPOLAR JUNCTION TRANSISTOR (BJT) 


When a layer of n-type silicon is sandwiched between two layers of p-type 
silicon, we obtain a pnp transistor. If a layer of p-type is sandwiched between 
two layers of n-type, we have an npn transistor. The filling of the sandwich 
is called the base (B) and the two ends are called the emitter (E) and the collector 
(C). The two types of transistors and their electronic symbols are shown in Fig. 
26-13. One very important fact is that in both cases the base is very narrow 
(~ 10~° m) and very lightly doped compared with the emitter. 

An npn transistor can be considered as an np diode followed by a pn 
diode, and a similar analogy applies to the pnp transistor. Consequently, the 
ideas presented in the previous discussion of the diode will be used to explain 
the physical behavior of the transistor. We will limit our discussion to the 
npn transistor. Arguments similar to ones to be presented for the npn transistor 
can be made for the pnp transistor. 

When the three semiconductor layers are put together, majority carriers 
will flow across the two junctions, creating contact potential differences be- 
tween the emitter and the base and between the collector and the base. If we 
assume completely symmetrical junctions (we assume that the emitter and 
the collector have identical doping concentrations), the contact potential V, 
at the emitter-base junction and at the collector-base junction will be equal. 
Recall that in the absence of an external bias voltage, the potential of the n 


(a) 

FIGURE 26-13 
(a2) Physical composition and 
electronic symbol of an npn tran- 
sistor. (b) Physical composition 
and electronic symbol of a pnp 
transistor. 
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side of a pn junction is higher than that of the p side by the contact potential 
V. (Section 26-3a). The variations in potential in the three sections of the npn 
transistor are shown in Fig. 26-14a. The associated potential energy barriers 
for electrons and holes are illustrated in Figs. 26-14b and 26-14c, respectively. 


26-4a Common Base Configuration 


To appreciate the basic features of a transistor as an active circuit element, 
let us consider the circuit shown in Fig. 26-15. The left side of the circuit, that 
is, the section containing V, the base, and the emitter, is identical to the 
forward biased diode circuit of Fig. 26-9a. On the other hand, the right side, 
the section containing V2, the collector, and the base, is identical to the reverse 
biased diode circuit of Fig. 26-9b. Because the base is common to both sections, 
Fig. 26-15 is referred to as a common base configuration circuit. For the reason 
given earlier (high resistance of the depletion layer at the junctions), the 
externally applied voltages will appear almost entirely across the junctions. 
The battery V;, on the left-hand circuit, will raise the potential of the p type 
base relative to the n type emitter by an amount that we will call Vgz. As a 
result, the potential difference between the emitter and base is reduced to 
V. — Vz. On the right-hand circuit, the opposite happens. The battery V> 
lowers the potential of the base relative to the collector by an amount that we 


FIGURE 26-14 

(a) Potential differences between 
the three parts (emitter, base, 
collector) of the transistor asso- 
ciated with the depletion layers 
at the emitter-base junction and 
at the base-collector junction. (b) 
and (c) Potential energy barriers 
for electrons and holes, respec- 
tively, associated with the poten- 
tial differences of (a). 
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FIGURE 26-15 
Common _ base_ configuration 
transistor circuit. 
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will call Vc; as a result the potential difference between the collector and the 
base is increased to V. + Vcg. Just as in the case of the diode, this leads to a 
reduction in the potential energy barrier at the emitter-base junction and an 
increase in the barrier height at the collector-base junction. The new potential 
differences between the junctions are sketched in Fig. 26-16a. The associated 
potential energy barriers for electrons and holes are shown in Figs. 26-16b 
and 26-16c, respectively. The dashed lines in the three figures represent the 
values of these quantities before the application of the biasing voltages. 

The lowering of the barrier at the emitter base junction will permit elec- 
trons to be injected from the emitter into the base, as well as holes from the 
base into the emitter. The electrons that neither recombine with holes in the 
base region nor flow through the contact lead on the base will diffuse across 
the base and reach the collector junction (see Fig. 26-17). Once there, they 
see the positive potential difference V. + Vcp (negative potential energy), 
they are accelerated across the junction, and are “‘collected’’ by the collector. 

Let us analyze the various components contributing to the emitter current 
iz, the collector current ic, and the base current ig shown in Fig. 26-15. First, 
however, we should note that any charge carrier crossing the emitter-base 
junction will contribute to the emitter current. For example, if an electron 
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(a) Reduction in the potential dif- 
ference between the emitter and 
the base by the battery V, of Fig. 
26-15 and increase in the poten- 
tial difference between the col- 
lector and the base by the battery 
V. of the same figure. The dashed 
lines represent the potential dif- 
ferences in the unconnected 
transistor. (b) and (c) Potential 
energy barriers for the electrons 
and holes, respectively, associ- 
ated with the new potential dif- 
ferences of (a). The dashed lines 
are the energy barriers in the un- 
connected transistor. 
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FIGURE 26-17 
Schematic representation of the 
flow of electrons and holes across 
the energy barriers at the two 
junctions of the transistor for the 
common base configuration cir- 
cuit of Fig. 26-15. 
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from the emitter crosses this junction into the base, an electron from the wire 
to the left of the emitter will enter the emitter region to preserve the charge 
neutrality of this region. When a hole crosses the junction from the base to 
the emitter, it will recombine with an electron in the emitter. As a result, the 
emitter region will have a net positive charge + |e| that will be neutralized by 
the entrance of an electron from the wire to the left of the emitter. For similar 
reasons, any charge carrier crossing the collector-base junction will contribute 
to the collector current ic. The difference between the carriers crossing the 
two junctions accounts for the base current ig. 

The emitter current ig consists of an electron current i; (majority electrons 
crossing from the emitter into the base) and a hole current i,. (majority holes 
crossing from the base into the emitter), 


lp = leg t+ lpg (26.9) 


In a commercial transistor, the doping of the emitter is much greater than 
that of the base. Because the current across the junction depends not only 
on the height of the barrier but also on the number of majority carriers in 
each region, this implies that i,, << i.2. Eq. 26.9 can be approximated as 


ig ~ ing (26.10) 


Let us now look at the collector current ic. What are the components of 
ic? The main contribution comes from the electrons emitted into the base that 
are able to diffuse across the base without recombining with holes. We can 
designate this contribution as a ig, where a is the fraction of the electrons 
that are able to diffuse across the base. How large is a? Its value depends on 
two things: (1) the so-called minority carrier lifetime 7, (the time that an 
electron can survive without recombining in a hole-rich region), and (2) the 
time that the electron remains in the base as it diffuses toward the collector 
junction, Tp. A typical value for 7, is 10~* sec and can be made greater by 
reducing the doping of the base. By making the base region very thin, the 
diffusion time tp can be made much smaller than the lifetime 7,. For a base 
0.1 mm wide, the diffusion time turns out to be roughly 10~° sec. Thus, 
practically all the electrons will be able to get to the collector. Typical values 
for a range from 0.900 to 0.998. 

Because the base-collector junction is reverse biased (see Fig. 26-16), there 
is no contribution from the majority carriers (holes in the base and electrons 
in the collector) to ic. The only additional contribution is that from the small 
reverse saturation current ig due to the flow of minority carriers across the 
collector-base junction. This contribution, as shown in Section 26.3c, is ex- 
tremely small. Thus, 


Ic = Alp oe lo 
~ air (26.11) 


The important conclusion is that the collector current is essentially determined by 
the emitter current, which is determined by the potential difference between 


In 1956 John Bardeen, William 
Shockley and Walter Brattain 
shared the Nobel Prize in Phys- 
ics for their discovery of the tran- 
sistor. 
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the base and the emitter Vpg. It is essentially independent of the potential 
difference between the collector and the base Vcg. Actually, the previous 
statement is not entirely correct because a can vary by 0.1% as Vcp changes 
from 0 V to 10 V. The reason is that the width of the depletion layer at the 
junction depends on the potential difference across the junction, as indicated 
in Section 26.3c. Thus, as the reverse bias voltage at the collector-base junction 
is increased, the depletion layer (region depleted of its majority carriers) 
penetrates deeper into both the collector and the base. This means that the 
“effective base” (the region where free holes exist) decreases. When this 
happens, the chances of recombination within the base decrease and thus a 
increases. 


26-4b The Transistor as a Voltage Amplifier 


Figure 26-18 illustrates a modified version of the common base configuration 
circuit that we have been considering, Fig. 26-15. A small resistor R; has been 
introduced in the emitter-base circuit and a larger resistor R2 in the collector- 
base circuit. The reason for the introduction of these two resistors and for 
choosing R2 greater than R, will become clear soon. 

We should note that although the introduction of these two resistors will 
reduce the potential differences across the junctions (recall that when a current 
i flows through a resistor R, there is a potential drop iR across the resistor), 
it does not affect the type of bias of the junctions. The emitter-base junction 
remains forward biased because the p-type base is connected directly to the 
positive side of V; whereas the n-type emitter is connected through the resistor 
R, to the negative side of V;. Similarly, the collector-base junction remains 
reverse biased because the p-type base is connected directly to the negative 
side of V2 whereas the n-type collector is connected through the resistor R» 
to the positive terminal of V2. We should also point out that the reduction of 
Vcr resulting from the presence of a large R2 will not significantly affect the 
value of ic because, as we have shown in the preceding section, ic is essentially 
determined by ig, not Ves. 

One of the main uses of the transistor is as an amplifier. In the common 
base configuration shown in Fig. 26-18, the input, as a voltage difference, is 
fed into the emitter-base circuit by applying a voltage AV, as shown in Fig. 
26-18. 


Output AV» 


FIGURE 26-18 

Modified version of the common 
base configuration circuit of Fig. 
26-15. A small resistor R,; and a 
large resistor R, have been intro- 
duced for voltage amplification 
purposes. 
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The output is taken out of the collector-base circuit as a voltage drop 
across resistor Ry. The circuit in Fig. 26-18 is an amplifier because the voltage 
is amplified by a large factor. The emitter current jg is limited by R,, which is 
usually chosen to be small (for example, 100 ©), and the effective resistance 
R, of the forward biased emitter-base junction. This current reappears (ic = 
Qiz) in the output circuit and flows through a large load resistor R» (for ex- 
ample, 50 kQ), called output resistance, and causes a large voltage drop across 
it. Let us assume that a small voltage AV; is applied to the input terminals; 
this will lead to a change in the emitter current i, so that 


AV, 
ae 
and to a corresponding change in ic 
AV 
Aic = aAip = 0 a rR (26.12) 
The output voltage across the load resistor R2 will increase by 
AV> = R, Alc 
Substituting Eq. 26.12 for Aic 
| | R, 
= @———— AV 
AV> a R, 4 R. 1 
and, because a ~ 1, we obtain | 
Rp 
eae NV. 26.1 
ay Tear Te (26.13) 


The input voltage will be amplified at the output by a factor equal to the ratio 
of the output to the input resistances. 

The effective resistance R, of the forward biased emitter-base junction is 
not constant but depends on the voltage across the junction. This is because 
in a p-n junction the current does not increase linearly with voltage; that is, 
a p-n junction does not obey Ohm’s law (see Fig. 26-12). As we saw in Section 
26.3c, for a forward biased voltage greater than a few tenths of a volt, the 
current increases very rapidly as the voltage across the junction increases; 
this means that R, is very small, typically a few ohms. If we take R, = 100, 
R; = 100 Q, and R, = 50 kQ, 


5 x 10? 0 
= —__—_—_ Ay 
ave 1002+100 ° 
AV> = 455 AV, 


The input voltage is amplified by a factor of 455. 
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26.4c Common Emitter Configuration 


The most common configuration, particularly in switching circuits, is the one 
in which the emitter is common to both the input and the output circuits. 
Figure 26-19 is an example of a common emitter configuration circuit. The 
input branch, consisting of V;, Ri, and the base-emitter junction is identical 
to the input section of the common base configuration circuit of Fig. 26-18. 
The p-type base is connected directly to the positive terminal of V; whereas 
the n-type emitter is connected through the resistor R; to the negative terminal 
of V,. The emitter-base junction is therefore forward biased. The output branch 
is different from any of the circuits that we have considered thus far. In 
addition to the battery V2 and the load resistor R2, it includes two junctions, 
the collector-base junction and the emitter-base junction. Because the emitter 
is connected directly to the negative terminal of V2, whereas the collector is 
connected through R, to the positive terminal of V2, the potential of the 
collector is higher than that of the emitter by an amount Vcg = V2 — icRo. 
The nature of the bias at the collector-base junction will depend on the relative 
values of the voltage between the collector and the emitter Vcg and the voltage 
between the base and the emitter Vpg. This will be considered in more detail 
below. 

In the common emitter configuration, the input current ig is the inde- 
pendent variable and the output current ic is the dependent variable. Putting 
it differently, the base current ig, which is determined by V; and R; because 
the emitter-base junction is forward biased, controls the collector current ic. 
Figure 26-20 shows the characteristic curves for a standard 2N2222A npn 
transistor. The horizontal axis is the collector-emitter voltage Vcr, and the 
vertical axis is the collector current ic. The curves correspond to different 
values of ig. 

The output characteristics can be divided into two regions: the active region 
and the saturation region. The active region is to the right of the knee of the 
curves; at the knee Vcg ~ a few tenths of a volt. In this region ic is very 
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FIGURE 26-19 
Common emitter configuration 
transistor circuit. 
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sensitive to ig but almost independent of Vcr. The saturation region is to the 
left of the knee. In the saturation region ic increases rapidly with Vcg. In the 
active region, the collector-base junction is reverse biased. Let us see how 
this comes about. The emitter-base junction is forward biased. As indicated 
in Section 26.3c, the voltage across a forward biased junction is usually limited 
to a few tenths of a volt, 0.6 V or less is a fairly typical value for a silicon 
junction. Therefore, Vgp will be 0.6 V or less. If Vcg is greater than 0.6 V, 
then the collector is more positive than the base relative to the emitter, and 
this in turn means that the collector is at a higher potential than the base: 
Consequently, the collector-base junction will be reverse biased. Thus, in the 
active region the emitter-base junction is forward biased whereas the collector- 
base junction is reverse biased, that is, the situation is analogous to the 
previously discussed common base configuration. We can explain the vari- 
ation of ic by making use of our previous result, Eq. 26.11, ic = a ig, and 
Kirchhoff’s current rule, 


Iz = Ip + Ic (26.14) 


Substituting for iz = ic/a in Eq. 26.14, 


1 
+= ig + ic 
a 
_ fil 
Ic (; oo 1) = Ip 
a 
- ff Le lc = Bip 
| _ 
Ic a e where 
; Y 
lc = Bp lp B= 


where B = a/(1 — a) is called the current gain parameter. We mentioned before 
(Section 26.4a) that a varies slightly with Vcg (and in this case with Vcr). The 
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change in a as Vcg changed by 10 V was ~ 0.1%. The’ variations in B can be 
significantly greater. For example, if we take a = 0.990, then B = 99; ifa = 
0.991, B = 110; a 0.1% change in a causes an 11% change in B. Other than 
this, the magnitude of ic is essentially determined by ig, and ic is very sensitive 
to ig. For example, in the case when a = 0.99, we saw that B = 99; therefore, 
a change of 10 pA in ig will give rise to a change of 990 wA in ic. Thus, when 
the common emitter circuit 1s used in the active region, it acts as a current amplifier. 

Let us now try to understand the rapid decrease in ic when Vcg drops 
below a few tenths of a volt (see Fig. 26-20). In this region, Vcg < Vp; that 
is, the base is more positive than the collector relative to the emitter, and 
therefore the base is at a higher potential than the collector. As a consequence, 
the collector-base junction will become forward biased. In our previous dis- 
cussion of the common base configuration, we mentioned that the only sig- 
nificant contribution to ic was due to that fraction a of the emitter-injected 
electrons that diffused across the base (that is, ic = aig). This is fine if the 
collector-base junction is reverse biased. In this case the only additional con- 
tribution to ic is the small reverse saturation current ig associated with the 
flow of minority carriers across the CB junction; this is very small. If, however, 
the CB junction is forward biased, holes (majority carriers) in the base can 
flow into the collector (we will call this component ,-) and electrons (majority 
carriers) in the collector can flow into the base (we will call this component 
iec) (see Fig. 26-21). This double flow of charge carriers corresponds to a large 
current that is in the opposite direction as that of aig. Thus the net collector 
current will be given by 


Ic — Ql ace (inc + lec) 


The net collector current will decrease rapidly as the CB junction becomes 
increasingly forward biased (that is, as Vcp decreases below 0.6 V), until 
Vcz = 0. At this point, the emitter and the collector are at the same potential. 
This in turn implies that the CB and the EB junctions are equally forward 
biased, and therefore the net current is zero. 


26.5 FIELD-EFFECT TRANSISTORS (FET) 


One disadvantage of the bipolar junction transistor is the low resistance of 
the input circuit; in many cases this is an undesirable feature. For example, 
if the voltage drop across a resistor is used for V; in Fig. 26-19, the current 
through the resistor will change and so will the magnitude of the voltage 
across the resistor. The disadvantage of low resistance in the input circuit is 
illustrated in Problem 27.8. This difficulty is remedied by another type of 
transistor whose general name is Field-Effect Transistor (FET). There are two 
basic types of FET’s: the Junction Field-Effect Transistor (JFET) and the Metal- 
Oxide-Semiconductor Field-Effect Transistor (MOSFET). 


oe i, oe 
FIGURE 26-21 

Schematic representation of the 
flow of electrons and holes across 
the two junctions of a transistor, 
connected in the common emit- 
ter configuration, when Vz is less 
than a few tenths of a volt. 
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FET’s have two salient features: 


1. They are voltage controlled devices. The current through the device is con- 
trolled by an electric field associated with a voltage placed at an electrode 
called the gate. This feature gives them the name FET. 


2. The current is transported by carriers of one polarity only (majority car- 
riers). For this reason, this type of transistor is also called a unipolar tran- 
sistor. 


The resistance of a wire filament is given by R = p d/A (Section 23.1), 
where p = 1/0 = m/q?Nz (Eq. 23.14), N is the carrier concentration, d is the 
length of the filament, A is the cross-sectional area of the filament, and 7 is 
the time between collisions. In the FET, the resistance, and therefore the 
current for a given voltage, is controlled by an input voltage applied at the 
gate. The input voltage determines either the area A or the concentration N 
or both. In the JFET, A varies; in the MOSFET, both A and N vary. 


26.5a The Junction Field-Effect Transistor (JFET) 


A schematic of a JFET and how it can be used in a circuit is shown in Fig. 
26-22. The JFET consists of a rod of a semiconductor (for example, n type) to 
which two metallic ohmic (as opposed to rectifying) contacts, called the source 
and the drain, are made. A heavily doped p-type semiconductor region, called 
the gate, surrounds the n-type rod. 

In its normal mode of operation, the drain is maintained at a positive 
potential V, with respect to the source. The gate is maintained at a negative 
potential V, with respect to the source. The potential difference between the 
drain and the source will give rise to a current i, through the n-type rod. This 
current will produce an i,R drop along the rod. As a consequence, the potential 
along the rod varies from nearly V, for that part of the rod close to the drain 
to nearly zero for the part close to the source. The p-n junction formed where 
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FIGURE 26-22 

Structure of a Junction Field-Ef- 
fect Transistor (JFET). In the fig- 
ure the JFET is connected in se- 
ries with a load resistor R; and 
the battery V4. The potential of 
the gate is the same as (when 
V, = 0) or lower than that of the 
source (when V, # 0). 
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the gate comes in contact with the rod is reverse biased, becoming progres- 
sively more so toward the drain. As a result, near the p-n junction there is a 
depletion layer. Moreover, because the gates are very heavily doped compared 
to the rod, this depletion layer lies primarily in the n region (in the rod). 
Because the width of this layer depends on the potential difference across 
the reverse biased junction, it will be wider close to the drain than close to 
the source; that is, it will be wedge-shaped. The depletion layer effectively 
acts as an insulator; this means that i, is forced to move through a funnel- 
shaped channel in the n-type rod. Figure 26-23 shows the dependence of i, 
versus V, for several values of the gate voltage, V,. 

Let us consider the case where V, = 0. As Vy, begins to increase from 
zero, the n-type rod behaves as an ohmic device; that is, ig is proportional to 
V,. For small V,s, the depletion layer is very small and the rod acts simply 
as a resistor of constant value. However, as V, increases, the p-n junction at 
the gate becomes more and more reverse biased; this means that the depletion 
layer spreads increasingly into the body of the n-type rod and forces the 
current to flow through a channel that is becoming progressively narrower. 
This has the effect of increasing the resistance of the rod (because of the 
decrease of the effective cross-section A). Thus the current i, does not increase 
as fast as it did initially when Vz was small. Eventually, the channel becomes 
so small that the expected increase in iz due to the increase in V, is inhibited 
by the associated increase in RK due to the reduction in size of the conducting 
channel: The current levels off. When this situation occurs, the FET is said 
to be in the “pinch off’ region. 

If V, is at some negative potential to begin with, the width of the depletion 
layer for a given V, will be greater than when V, was zero. This has two 
effects: (1) the initial resistance of the rod (when V, was small) will be greater, 
which explains why the slopes of the curves decrease as V, becomes more 
and more negative; and (2) the pinch off will occur at smaller Vz and ij. 

The electronic symbol for an n-channel JFET is shown in Fig. 26-24a. A 
p-channel JFET can be constructed by making the rod of a p-type material and 
the gates of heavily doped n-type material. The polarity of Vz and V, has to 


FIGURE 26-23 
Output characteristic curves—i, 


| versus V,—for several values of 
~~ the gate voltage V, for the JFET 


circuit of Fig. 26-22. 
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FIGURE 26-24 
(a) Electronic symbol for an 
n-channel JFET like the one in 
Fig. 26-22. (b) Electronic symbol 
for a p-channel JFET. 
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be reversed. The electronic symbol for a p-channel JFET is shown in Fig. 
26-24b. 

At the beginning of our discussion, we indicated that the FET’s solved 
the problem of low input resistance. The input to the FET, whether used as 
an amplifier element or as a switch, is fed in the gate circuit. In this circuit 
there is a reverse biased p-n junction; therefore, the resistance is very large, 
typically on the order of 1017 ©. 


26.5b The Metal-Oxide-Semiconductor 
Field-Effect Transistor (MOSFET) 


Another type of FET, the metal-oxide-semiconductor field-effect transistor, is 
illustrated in Fig. 26-25. It consists of two heavily doped n-type regions dif- 
fused into a p-type substrate. The two n regions serve as the drain and the 
source. The gate electrode is a metallic layer deposited on top of an insulating 
layer of SiO2. When V, = 0, no current can flow between the drain and the 
source. The reason is that where the drain contacts the p-type substrate we 
have a reverse biased junction. If a positive potential is applied at the gate, 
electrons from the p substrate are attracted toward the SiO,-substrate inter- 
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FIGURE 26-25 
Structure of a Metal-Oxide-Semi- 


conductor Field-Effect Transistor 
(MOSFET). 


p-type substrate 
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FIGURE 26-26 
V,= +1¥ Output characteristic curves—i, 
a a Ue 7 = versus V,—for several values 
s ‘0 of the gate voltage V, for the 


Vq (V) —> MOSFET circuit of Fig. 26-25. 


face. These electrons will recombine with the holes in the region of the p type 
substrate near the interface. Eventually a small layer near the interface will 
become an n-type layer. This provides a channel for the current i, to flow 
from the drain to the source when a voltage V, is applied between them. The 
greater V., the greater the width of the channel and consequently, the greater 
ig Will be for a given Vy. This is shown in Fig. 26-26, which illustrates the 
dependence of i, on V, for different values of V.. 

In addition to the feature already mentioned (the increase of iz with V,), 
there is also a leveling off of iz after a certain Vz. The reason is the same as 
in the case of the JFET. When the n-type channel is formed, we have a p-n 
junction between the p-type substrate and the n-type channel. A depletion 
layer is formed that increases as V; becomes more and more positive because 
the n-type channel becomes positive (more so near the drain than the source) 
with respect to the p-type substrate, which is connected directly to the negative 
terminal of V,z. This depletion layer increases the resistance of the channel 
and eventually leads to the leveling off of the current ij. 

The electronic symbol for an n channel (p-type substrate) MOSFET and 
for a p-channel (n-type substrate) MOSFET are shown in Figs. 26-27a and 
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ra 
B (substrate) 


(a) 


(b) 

FIGURE 26-27 

(a) Electronic symbol for an 
n-channel (p-type substrate) 


26-27b, respectively. 


PROBLEMS 


26.1 Thecurrentthroughap-njunctionis1 x 10-°A 
when a reverse bias voltage of 10 V is applied across 
the junction at T = 300 K. What will be the current 
through the diode when a forward bias voltage of 
(a) 0.1 V, (b) 0.3 V, and (c) 0.5 V is applied? 


26.2 In the ideal diode the reverse saturation cur- 
rent should be as small as possible. Considering the 
fact that E, for Si is 1.1 eV and E, for Ge is 0.67 eV, 
which material is better suited for the fabrication of 
p-n junction diodes? | 


96.3 The reverse saturation current of a silicon diode 
is ig = 5 X 10°? A. The voltage across that diode 
when forward biased is 0.45 V. (a) What is the cur- 
rent through the diode at T = 27°C? (b) If the voltage 
across the diode is held constant, and we assume 
that ig does not change with temperature, what is 
the current through the diode at T = 47°C? 
(Answer: (a) 0.18 A, (b) 0.06 A.) 


26.4 In Problem 26.3 we assumed that the reverse 
saturation current ip remains constant when the tem- 
perature changes. (a) Show that this assumption is 


MOSFET. (b) Electronic symbol 
for a p-channel (n-type substrate) 
MOSFET. 


highly incorrect by calculating ig at T = 47°C when 
io = 5 X 10°? A at 27°C. Assume that the Fermi 
level on the p side of the junction is 1 eV below the 
bottom of the conduction band. (b) If the voltage 
across the forward biased diode is 0.45 V, as in Prob- 
lem 26.3, what is the current through the diode at 
= 47°C? 

(Answer: (a) 5.6 X 1078 A, (b) 0.67 A.) 
26.5 The reverse saturation current of a silicon diode 
doubles when the temperature changes from 27°C 
to 33°C. What is the position of the Fermi level on 
the p side of the junction? 


26.6 In the circuit of Fig. 26-28, the voltage across 
the resistor R is 4.6 V. What is the reverse saturation 
current of the silicon diode if the temperature is 27°C? 

(Answer: 1.8 x 10719 A.) 


FIGURE 26-28 


= |} —_ 
Problem 26.6. Vy =5V 
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26.7 (a) What is the voltage across the resistor R of 
Problem 26.6 if the diode is reversed, as shown in 
Fig. 26-29? (b) What is the effective resistance of the 
diode? 

(Answer: (a) 9 x 1077 V, (b) 2.8 x 10?° 2.) 


kK} 


FIGURE 26-29 Lo " 
Problem 26.7. V=5V 


26.8 A diode is connected in series with a resistor 
R = 1000 Q and a battery V = 10 V (see Fig. 26-30). 
The reverse saturation current of the diode is ip = 
10~® A. (a) Assume that the voltage Vo across the 
forward biased diode is 0.6 V, what is the current i 
in the circuit? (b) Use the diode equation, Eq. 26.8, 
and the answer found in part (a) to evaluate Vo (as- 
sume I’ = 300 K). Does the answer to (b) agree with 
the assumption in part (a)? (c) Repeat (a) and (b) for 
the following assumed values of Vo: 0.5 V, 0.4 V, 
and 0.3 V. (d) What is the approximate value of Vo? 
(Answer: (a) 9.4 x 107° A, (b) 0.356 V, 

(c) 9.5 x 1073 A, 0.356 V; 9.6 x 107° A, 0.357 V; 
9.7 x 10°° A, 0.357 V, (d) 0.357 V.) 


meV -e4 


FIGURE 26-30 4) | 
Problem 26.8. . 1 OV 


26.9 In Problem 26.8, by trying different values for 
the voltage Vo across the diode, we were able to get 
an estimate of the actual value. The exact value of 
Vo can be found by applying Kirchhoff’s loop rule 
(see Section 15-6) to the circuit of Fig. 26-30, 


OT 
10V =V, + 10°01 


because the current through the resistor is the same 
as the current through the diode, i is given by Eq. 


26.8, therefore 


kel 


This equation must be solved to find Vo at a given 
temperature. Unfortunately, this is a transcendental 
equation that cannot be solved analytically. It can 
be solved numerically with the aid of a computer. 
Different values of Vo are systematically inserted into 
the equation until a value is found for which the 
equation holds. Assume that T = 300 K and that 
the reverse saturation current is ij = 107° A. Write 
a computer program that will yield a value for Vo 
within 0.1% of the actual value. 


26.10 Use the program of Problem 26.9 to find 
the voltage across the forward biased diode of Fig. 
26-30 when the temperature rises to 320 K. The 
Fermi level in the p side of the diode is 1 eV below 
the bottom of the conduction band. Recall (see Prob- 
lem 26.4) that ig varies with temperature. 


26.11 An alternating voltage V = 120 sin at V is 
applied to the input terminals of the circuit shown 
in Fig. 26-31. Assume that the forward voltage across 
the diode is 0.5 V. (a) Sketch the input voltage as a 
function of time. (b) Sketch the output voltage (volt- 
age across R) as a function of time. 


—<—Er 


R = 5000 Vout 


FIGURE 26-31 | 
Problem 26.11. _ 


26.12 The alternating voltage of Problem 26.11 is 
applied to the input terminals of the circuit shown 
in Fig. 26-32. Sketch the output voltage Vout as a 
function of time f. 


FIGURE 26-32 Problem 26.12. 


26.13 Sketch the unbiased and the biased energy 
diagrams for a pnp transistor in the common base 
configuration. 


26.14 Draw acommon emitter circuit similar to the 
one shown in Fig. 26-19 for a pnp transistor. Indicate 
in a figure similar to Fig. 26-21 the different current 
components crossing the two junctions in the sat- 
uration region and in the active region. Assume that 
the base is very lightly doped compared with the 
emitter and the collector. 


26.15 Annpn transistor with a gain parameter B= 
10 is connected in the common base circuit shown 
in Fig. 26-33. An alternating voltage Vi, = 2 sin wt V 
is applied to the input terminals. What is the out- 
put voltage Vout? Assume that resistance of the emit- 
ter-base junction is constant and equal to 10 2. 
(Answer: 71 sin ot V.) 


\ , — 
? R = | 46 KA! \’ i! 


= 50012 
FIGURE 26-33 a -—+ | + | | = 
= 5-0¥ '> = 150V 


Problem 26.15. V) 


y 
¢—0 


26.16 Consider the data presented in Fig. 26-20. 
(a) What is the approximate value of the current gain 
parameter? (b) What fraction of the electrons in- 
jected from the emitter into the base reach the col- 


lector? 
(Answer: (a) 200, (b) 0.995.) 


26.17 A 2N2222A transistor having the character- 
istics shown in Fig. 26-20 is used in a common emit- 
ter circuit (see Fig. 26-19). The base resistor Ky = 
20 kQ,, V; = 3 V, and the voltage across the emitter- 
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base junction is 0.4 V. (a) Calculate the base current 
ip. (b) Use the current gain parameter found in Prob- 
lem 26.16 to obtain the collector current ic. (c) If 
R, = 1kQ and V2 = 35 V, what is the collector- 
emitter voltage Vcr? 

(Answer: (a) 130 wA, (b) 26 mA, (c) 9 V.) 


26.18 The width of the depletion layer on the p 
side of a p-n junction is given by 


2Key (V. — Vo) |? 


Na 
N 1+ — 
° «( x.) 


where « is the dielectric constant of the semicon- 
ductor (for Si, K = 12), €&) = 8.85 x 10°” farad/m 
is the permitivity of free space, V. is the equilibrium 
contact potential, Vp is the external bias voltage of 
the junction (taken as positive when forward biased 
and negative when reverse biased), and N, and Np 
are the impurity concentrations in the p region and 
in the n region, respectively. Calculate the width of 
the depletion layer on the p side of a junction when 
(a) Vo = 0 V, and (b) Vo = —10 V. Assume V, = 
0.6 V, Nz = 107° m~3, and Np = 107% m-?. 
(Answer: (a) 2.82 x 10~°m, (b) 11.86 x 107° m.) 


Xp = 


26.19 An npn silicon transistor with an a = 0.990 
when unbiased is used in a common emitter circuit 
similar to the one in Fig. 26-19. The width of the 
base is 30 wm. Let us assume that the number of 
electrons injected from the emitter into the base that 
recombine with holes in the base is proportional to 
the width of the “effective base” (the nondepleted 
region). (a) What will be the value of a when Vcg = 
10 V? (b) What will be the change in 6 as Vcg varies 
from 0 V to 10 V? Use the doping levels and the 
equilibrium contact potential of Problem 26.18. 
(Answer: (a) 0.993, (b) B (0 V) = 99, 
B (10 V) = 142.) 
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27.1 INTRODUCTION 


The complex logical operations performed by any digital instrument, be it a 
simple digital voltmeter or a sophisticated computer, are based on a system 
of symbolic concepts known as boolean algebra, named after George Boole, 
who began publishing the algebra of logic in 1847. In a digital instrument, 
these symbolic concepts are implemented by means of electronic circuits called 
logic circuits or gates. 

In this chapter we will present a brief introduction to some of the prin- 
ciples of boolean algebra. We will then show how the semiconductor devices, 
diodes and transistors discussed in the preceding chapter, are used to con- 
struct the gates that implement this algebra. 


27.2 RUDIMENTS OF BOOLEAN ALGEBRA 


The three basic logic operations of boolean algebra are named AND, OR, and 
NOT. 


AND Operation 


Consider two switches A and B wired in series with a battery and a load 
resistor R as in Fig. 27-1. Current will flow if both switches A and B are closed. 
These switches perform the AND operation, which in symbolic logic is rep- 
resented as 


A-B=T orsimply AB =T (27.1) 


The statement above reads: If A and B are closed, transmission will occur. 
More generally, A could represent the truth of some condition and B the 
truth of another condition. And the statement would read: If condition A is 
satisfied (is true) and at the same time condition B is satisfied, then the desired 
result will be obtained and the output will be true (T). 

It is customary to represent a condition that is TRUE by 1 and a condition 
that is NOT TRUE by 0. A way to represent all possible combinations of the 
states for all the variables involved is by means of a truth table. The truth table 
for the AND operation with two variables A and B and the result T is shown 
in Table 27-1. In this table the symbol 0 and 1 under A and B mean open and 


R 


ay = 
he a 


George Boole (1815-1864). 


FIGURE 27-1 


Implementation of the AND logic 


function with 
switches. 


two 


manual 
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closed, respectively, while these symbols under T mean not true and true, 
respectively. 


TABLE 27-1 

Truth Table for the AND 
Operation 

A B T 
0 0 0 
0 1 0 
1 0 0 
1 1 1 


OR Operation 


If two switches A and B are wired in parallel with each other, and this parallel 
combination is then connected in series with a battery and a load resistor R, 
as in Fig. 27-2, transmission will occur if either A or B is closed. The circuit 
will perform the OR operation, which in symbolic logic is represented as 


A+B=T (27.2) 


This statement means that the desired result is true if either condition A or 
condition B or both are true. The truth table for the OR operation with two 
variables A and B is shown in Table 27-2. 


TABLE 27-2 

Truth Table for the OR 
Operation 

A B a 
0 0 0 
0 1 1 
1 0 1 
1 1 1 


FIGURE 27-2 


Implementation of the OR logic 


function with 
switches. 


two 


manual 
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NOT Operation 


The NOT operation simply inverts the logic condition. It is written as A, 
which reads NOT A. Thus if A stands for “the sun will shine tomorrow,” 
then A stands for “the sun will not shine tomorrow.” As an example of a 
truth table using the NOT concept, consider the following boolean equation. 


A-B=T (27.3) 


Equation 27.3 means that if condition A is satisfied and at the same time 
condition B is not satisfied, the output will be true. The truth table for such 
an equation is given in Table 27-3. 


TABLE 27-3 7 
Truth Table for A-B = T 


A B T 
0 0 0 
0 1 0 
1 0 1 
1 1 0 


27.2a Some Theorems in Boolean Algebra 


Boolean algebra allows us to manipulate these three basic concepts in order 
to simplify statements of logic and the circuitry necessary to implement them. 
We list here some important theorems. As before, + means or and a product 
means and 


1. Commutation theorems 
A+B=B+A 


AB = BA 


2. Association theorems 
A+ (B+C)=(A+B)+C 
A(BC) = (AB)C 
3. Distribution theorems 
A + (BC) = (A + BYA + C) 
A(B + C) = AB + AC 


4. Absorption theorems 
A+ AB=A 


A(A + B)=A 
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5. DeMorgan theorems 
A+B=A°B 
AB=A+B 
These theorems can be verified by simple intuition or by writing the appro- 
priate truth tables. As an example, let us consider the second of the DeMorgan 
theorems, AB =A +B. 
Because AB stands for the inverse of AB, and the output of AB is 1 only 


when A is 1 and B is 1, the output of AB will be 0 only when A = 1 and 
B = 1. The truth table for AB = T is shown in Table 27-4. 


TABLE 27-4 | 
Truth Table for AB = T 


- 


A B 

0 0 1 
0 1 1 
1 0 1 
1 1 0 


A comparison of the truth tables for AB (Table 27-4) and for AB (Table 27-1) 
shows that the outputs are inverted. 

By the definition of the OR concept, A + B will have an output 1 if 
either A = 0 or B = 0 or both A and Bare 0. The truth table for A + B = T 
is therefore identical to that for AB = T (Table 27-4), and we conclude that 
AB = A + B. 


27.3 ELECTRONIC LOGIC CIRCUITS 


The logic concepts introduced in Section 27.2 can be implemented with elec- 
tronic circuits where the signals have two different values. For example, state 
1 could be represented by a signal of 5 V and state 0 by a signal of 0 V. The 
most commonly used signal levels are 5 V and 0 V or 10 V and 0 V, although 
the exact value of the input and output voltages are not critical, as long as 
they are clearly distinguishable. A logic circuit is one with an output signal 
that is the logical function of the inputs. These circuits are called gates. 


27.3a AND Gate 


The AND gate is one with inputs and output levels corresponding to the 
truth table 27-1 for the AND function. It may have several input terminals, 
one for each condition of A, B, and so on, and one output terminal for the 
result T. Only when all the inputs are at logic level 1 (5 V) will the output be 
a logic level 1. The electronic symbol for an AND gate with two inputs A and 
B is shown in Fig. 27-3. 


Ac 
0 Luipul 


FIGURE 27-3 
Electronic symbol for a two-in- 
put AND gate. 
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Before we consider how an AND gate can be built, let us introduce a 
convention that will be used often in this chapter. When we specify the 
potential of a point in a circuit, we must always specify the reference point, 
that is, the point whose potential we arbitrarily call zero. It is common practice 
to call this reference point the ground. In household wiring, the ground is 
obtained by making a tight connection to a water pipe made of copper. In an 
electronic circuit the ground is the metal chassis of the instrument that is 
assumed to be equipotential throughout, that is, to have no resistance. The 
electronic symbol used to represent the ground is shown in Fig. 27-4. When 
we say that an input or an output signal is 5 V, we mean that the input or 
output voltage is 5 V above ground. Similarly, an input of 0 V means that 
the input connection is at the same potential as the chassis ground; we often 
use the expression “‘it is grounded.” 

In today’s computers, the logic circuits are built with semiconductor diodes 
and transistors. In the early days of the computer era, relay switches and 
vacuum tubes were widely used in the construction of computers. Because 
they are easier to understand, we will first show how the gates can be built 
with relay switches. 

A relay switch is a remotely controlled switch. A schematic representation 
of one type of relay switch is shown in Fig. 27-5. A current through a coil 
produces a magnetic field that attracts one end of a pivoted lever toward the 
coil. A switch contact mounted on the lever provides a movable contact that 
can connect to either of two stationary contacts. When no current flows through 
the coil, a spring attached to the other end of the lever keeps the lever in the 
position shown in Fig. 27-5 and, as a result, the movable contact is connected 
to the upper stationary contact. If one end of the coil is grounded and an 
input voltage is applied to the other end, a current will be set through the 
coil. The end of the lever above the top of the coil will be attracted downward, 
and the movable contact will be connected to the lower stationary contact. 
The electronic symbol used to represent a relay switch is shown in Figs. 
27-6a and 27-6b. In Fig. 27-6a, the coil is not activated (no current); as a result, 
the movable contact is connected to the upper stationary contact. In Fig. 
27-6b, the coil is activated; consequently, the movable contact is connected to 
the lower stationary contact. We will assume that a 5-V input will produce 
the necessary current to have the coil attract the lever. 

An AND gate built with relay switches is shown in Fig. 27-7. It has two 
inputs, V4 and Vz, which represent two logic conditions A and B. The output 


Movable contact ~ 


4, 


—\, Stationary 


contacts 


=a 
FIGURE 27-4 
Electronic symbol for the ground. 


FIGURE 27-5 
Relay switch. 
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| ) FIGURE 27-6 

6 Electronic symbol for a relay 
: switch. (a) The coil is not acti- 
vated (no current through the coil) 
in = ° resulting in the connection of the 
movable contact with the upper 
stationary contact. (b) The coil is 
activated, the movable contact is 
(b) connected to the lower station- 

ary contact. 


is the potential of the movable contact of switch 2. The two upper stationary 
contacts are grounded. The lower stationary contact of switch 1 is connected 
to a 5-V source (note that this 5-V signal does not represent an input con- 
dition), while the lower stationary contact of switch 2 is connected to the 
movable contact of switch 1. Suppose Va = Vz, = 5 V. Both coils will be 
activated, the movable contacts of both switches will be connected to their 
lower stationary contact, and thus Vou. = 5 V. If Vg = 0 V, then the movable 
contact of switch 2 will be connected to its upper stationary contact, which 
is grounded; thus, regardless of whether Va = 5 V or 0 V, Vou = 0 V. When 
Va = 0 Vand Vz, = 5 V, the output through the movable contact of switch 
2 is connected to the lower contact of switch 2, which in turn is connected 
to the movable contact of switch 1, and because switch 1 is not activated, its 
movable contact is connected to its upper contact, and therefore grounded, 
Vout = 0 V. Therefore, only when both V, and Vj, are 5 V (logic level 1) will 
the output be 5 V. The circuit in Fig. 27-7 implements the AND logic function. 


Vout 


FIGURE 27-7 
A two-input AND gate made with 
Switch 1 Switch 2 relay switches. 


27.3b OR Gate 


The OR gate has a logic level 1 output when any one input or all the inputs 

are at logic level 1 (see Table 27-2). The electronic symbol used to represent 

the OR gate is shown in Fig. 27-8. An OR gate built with relay switches with ___) >— 7 
two inputs V, and Vj, is schematically represented in Fig. 27-9. The output B ato 
is again the potential of the movable contact of switch 2. If both V, and Vg FIGURE 27-8 


are 5 V, then the output is connected through the lower contact of switch 2 Electronic symbol for a two-in- 
to the 5-V source; that is, Vor = 5 V. The same is true if Vg = O V_ put OR gate. 
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and Vg = 5 V. If Va = 5 Vand Vz, = 0 V, the output is connected through 
the upper contact of switch 2 and the movable and lower contacts of switch 
1 to the 5-V source, Vou is still 5 V. Finally, if Vk = Vg = 0 V, the output 
will be connected, through the two movable contacts and the two upper 
contacts, to ground and V,,, = 0 V. Thus, if either V4 or Vg or both are 5 V 
(logic level 1), the output will also be 5 V. Only when both inputs are 0 V 
(logic level 0) will the output be 0 V. The circuit in Fig. 27-9 implements the 
OR logic function illustrated in Table 27-2. 


27.3c The Inverter 


To complete the list of circuits needed to perform the basic logic operations, 
the NOT function is implemented by an inverter. An inverter is a circuit with 
an output logic level opposite that of the input. The electronic symbol of an 
inverter is shown in Fig. 27-10. 

A relay switch circuit that implements the NOT function is shown in Fig. 
27-11. It is seen that if the relay is not activated (V, = 0 V) the movable 
contact will be connected to the upper contact and as a consequence the 
output, that is, the potential of the movable contact will be 5 V. An input of 
5 V in the relay switch will ground the output, thereby inverting the logic 
signal. 


27.4 SEMICONDUCTOR GATES 


We have implemented the different logic functions with relay switches be- 
cause they are easy to understand. The early electronic computers were built 
with relay switches. However, in modern technology various semiconductor 
devices are used for switches. Several examples follow. 


2/.4a Diode Switch 


A diode may be used as an electronic switch in the circuit shown in Fig. 
27-12a. The circuit has been redrawn in more conventional form in Fig. 
27-12b. Because, as indicated earlier, potential differences are measured with 


FIGURE 27-9 
A two-input OR gate con- 
structed with relay switches. 


Input Output 


FIGURE 27-10 

Electronic symbol for the in- 
verter, a circuit that performs the 
NOT logic function. 


FIGURE 27-11 
An inverter constructed with a 
relay switch. 


Vie Nn p 
(a) | 

FIGURE 27-12 
An electronic diode switch. (a) Abbreviated diagram of the circuit. (b) Circuit redrawn 
in more conventional form. (c) The input Vj is 5 V; this is done by connecting the 
input terminal to the positive terminal of a 5-V battery. (d) The input Va is 0 V, this 
is done by connecting the input terminal to ground. 


respect to ground, in Fig. 27-12b we have represented the + 5-V source of 
Fig. 27-12a as a battery with its positive terminal connected to point 3 and its 
negative terminal connected to ground. The output voltage will be the po- 
tential difference between the p side of the diode and ground. The input 
signal will be a voltage applied between the n side of the diode and ground. 
An input of 5 V can be represented by a battery with the positive terminal 
connected to the n side of the diode and the negative terminal to ground, as 
in Fig. 27-12c. An input of 0 V places the n side of the diode at ground potential; 
therefore, we can represent this by connecting the n side to ground, as in 
Fig. 27-12d. 

Let us consider the situation shown in Fig. 27-12c, Va = 5 V. We note 
that the potential of points 1 and 3 is the same, 5 V. Consequently, there will 
be no current from point 3 to point 1. If we assume that the voltmeter used 
to measure V,,, has a very high input resistance (ideally infinite), there will 
be no current through the resistor R. This in turn means that both sides of 
the resistor, points 2 and 3, are at the same potential (recall that the potential 
drop across a resistor is iR). Because the potential of point 3 is 5 V, Vout, the 
potential of point 2, will also be 5 V. 


Let us now consider the case Va = 0 V, as in Fig. 27-12d. In this case 


the diode is forward biased because the n side of the diode is connected 
directly to ground whereas the p side is connected through the resistor R to 
the 5-V source. The forward biased diode will conduct and, as we saw in 
Section 26.3c, the voltage across the diode (between point 2 and the grounded 
point 1) will be a few tenths of a volt, 0.6 V or less is a typical value for a 
silicon diode. Thus, when Va = 0 V, Vout (the potential of point 2) will be 
approximately 0. Note that now most of the 5-V drop between points 3 and 
1 occurs across resistor R, not across the diode. | 

The circuit of Fig. 27-12a is an electronic switch because an input signal 
of 5 V turns the output “on” (5 V), whereas an input of 0 V turns the output 
“off” (approximately 0 V). 


SEMICONDUCTOR GATES 
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27.4b Diode AND Gate 


A diode AND gate with two inputs V, and Vx to represent two logic conditions 
can be constructed with two diode switches like the one discussed in the 
previous section, see Fig. 27-13a. The circuit consists of two diode switches 
connected in parallel with the resistor R and the 5-V source common to both. 
This is readily seen if we redraw the circuit in more conventional form as in 
Fig. 27-13b. As we showed in the preceding section, a particular diode will 
conduct (will be forward biased) if its corresponding input is grounded, that 
is, 0 V, and will not conduct if the input is 5 V. Thus, when Va = Vz = 
5 V (Fig. 27-13c), neither diode will conduct; as a consequence there will be 
no current through the resistor R and this in turn means that both sides of 
R, points 3 and 2, are at the same potential. Because the potential of point 3 
is 5 V, the output Vout, which is the potential of point 2, will also be 5 V, that 
is, Vout = 5 V. Thus when both inputs are at logic level 1, the output will 
also be at logic level 1. If V, = 0 V and Vg = 5 V (Fig. 27-13d), diode A will 
be forward biased and will conduct. As a result, the potential of the p side 


(c) 
FIGURE 27-13 

Diode AND gate. (a) Abbreviated diagram of the circuit. (b) Circuit redrawn in more 
conventional form. (c) The inputs to both diodes are connected to the positive terminal 
of 5-V batteries to represent the condition V, = 5 V, Vg = 5 V. (d) The input to diode 
B is connected to the positive terminal of a 5-V battery, whereas the input to diode 
A is grounded to represent the condition Vg = 5 V, Va = 0 V. (e) Both inputs are 
grounded, Va = 0 V, Vg = OV. 


of diode A (point 2) will be 0.6 V or less higher than the n side of diode A, 
which is grounded. V,,,, will be approximately 0 V. Note that the p side of 
diode B is connected to the p side of diode A; therefore its potential will also 
be 0.6 V above ground; because the potential of the n side of diode B is 5 V, 
we see that diode B is reverse biased and therefore does not conduct. Similar 
conclusions can be drawn if Va = 5 V and Vg = 0 V. Diodes A and B reverse 
roles, but the output voltage will still be the voltage of the forward biased 
diode, diode B in this case—that is, Vou ~ 0 V. Finally, if Va = Vg = 0V 
(Fig. 27-13e), both diodes will be forward biased and V,y. ~ 0.6 V. Thus, if 
either or both inputs are at logic level 0, the output will be at logic level 0. 
The circuit in Fig. 27-13 implements the AND logic function. 


27.4c Diode OR Gate 


The circuit of two diodes in Fig. 27-13 can be rearranged to create an OR gate. 
Consider the circuit in Fig. 27-14. The circuit has two inputs, V, and V3, to 
represent two logic conditions. Note that now the input signals are applied 
to the p side of the diodes and the output is the potential of the n side of the 
diodes. The 5-V source has been removed, and the load resistor is connected 
directly to ground. 

lf Va = Vg = 5 V (Fig. 27-14b), the diodes will be forward biased because 
their p side will be directly connected to the positive terminal of the input 
sources and therefore will be at a higher potential than the n side. Being 
forward biased, the potential difference across the diodes will be approxi- 
mately 0.6 V. Because the potential of the p side is 5 V, the potential of the 
n side will be 5 V — 0.6 V = 4.4 V. That is, Vou: ~ 5 V or logic level 1. If 
Va = 5 V and Vz = 0 V (Fig. 27-14c), diode A will be forward biased and 
for the same reason as before the potential of its n side will be approximately 
5 V (4.4 V); that is, the output will be at logic level 1. We should note 
that diode B is now reverse biased because the potential of the 1 side is 
4.4 V whereas the p side is grounded. Similar conclusions can be drawn if 
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(b) (c) 

FIGURE 27-14 

Diode OR gate. (a) Abbreviated diagram of the circuit. (b) The inputs to both diodes 
have been connected to the positive terminal of 5-V batteries to represent the condition 
Va = 5 V, Vs = 5 V. (c) The input to diode A is connected to the positive terminal 
of a5 V battery, whereas the input to diode B is grounded to represent the condition 
Va = 5 V, Vs = 0 V. (d) Both inputs are grounded, and therefore V, = Vg = 0 V. 
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Va = 0 V and Vz = 5 V. Thus if either or both inputs are at logic level 1, 
the output will also be at logic level 1. If Va = Vg = 0 V (Fig. 27-14d), there 
will be no current through any part of the circuit because there are no sources 
of potential differences in the circuit. If there is no current through the resistor, 
both sides of R must be at the same potential. One side of R is grounded; 
therefore the other side that is connected to the output will also be at ground 
potential, Vou. = O V. The circuit in Fig. 27-14 implements the OR logic 
function. 


2/.4da The Inverter 


As discussed in Section 27.3c and shown in Fig. 27-11, an inverter can be 
used as a NOT circuit. The transistor can replace the switch to make such a 
circuit. Figure 27-15a shows a diagram in which the transistor is wired in its 
common emitter configuration considered in Section 26.4c. Vcg represents 
the voltage between the collector and the emitter, that is, the potential drop 
across the transistor. Figure 27-15b gives the current-voltage characteristics 
for that configuration. The curves in Fig. 27-15) give the current through the 
collector, ic, as a function of the voltage between the collector and the emitter, 
Vce, for different values of the base current ig. 

Suppose that V2 and Rc are fixed. Using Kirchhoff’s second rule (Section 
15.6), we see that 

V2 = Relc + Vee 

which can be written as 


ic = R. Re (27.4) 
A plot of ic versus Vcr gives the straight line in Fig. 27-15b, called the load 
line, whose slope is determined by Rc. We can plot the load line by locating 
two points, for example, the intercepts of the line with vertical and horizontal 
axes. The vertical axis intercept is obtained by setting Vcg = 0 in Eq. 27.4; it 
corresponds to ic = V2/Rc. The horizontal intercept corresponds to ic = 0, 
and from Eq. 27.4 occurs when Vcg = V2. In Fig. 27-15b we have chosen 
V2 = 5 V. For a given V2 and Rc the values of ic and Vcg must satisfy the 
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ic: (mA) ——pe- 


(a) (b) 


FIGURE 27-15 

(a) An npn transistor in a com- 
mon-emitter configuration cir- 
cuit. (b) The output characteristic 
curves—ic versus Vc;—for dif- 
ferent values of ig for the com- 
mon-emitter circuit of (a). The 
load line corresponds to V2 = 
5 V and Ro = 1350 0. 


load line equation, that is, the value of ic and corresponding value of Vcg 
must be such that the point lies on the load line. At the same time, for a 
given ig the relation between ic and Vcg is given by the characteristic curve 
corresponding to that value of ig. This means that in a given situation (a given 
V>, Rc, and ig), the actual value of ic and Vczg is given by the intercept of the 
load line and the appropriate characteristic curve. Thus, by changing the value 
of ig (which is controlled by V; and Rg,), from 0 wA to 2 20 wA, Vcr can be 
made to switch from V> (5 V) to ~ 0 V. This is seen in Fig. 27-15b, where the 
load line intersects the ic — Vcg curve at 5 V when ig = O and at Vcg = 0.3 V 
(~ 0 V) when ig = 20 pA. In Fig. 27-16, the circuit of Fig. 27-15a has been 
redrawn symbolically. Vout is the collector-emitter voltage difference Vcg, 
whereas V;,, corresponds to V; in the preceding diagram. Let us assume that 
Rs, is such that when V,, = 5 V, ig = 20 MA. If Vin = 0 V, ig = 0 wA and the 
intercept of the load line of Fig. 27-15b and the appropriate characteristic curve 
corresponds to Vout (Vcr) ~ 5 V. Thus, if the input is at logic level 0, the 
output will be at logic level 1. If Vin = 5 V, t3 2 20 wA and the intercept of 
the load line and the characteristic curve corresponds to Vou. ~ 0 V. Thus, if 
the input logic level is 1, the output logic level is 0. This circuit performs the 
NOT logic function. It should be noted that because Rg controls the base 
current and Rc determines the slope of the load line, they must be properly 
chosen so that the two logic levels be properly distinguishable. That is, the 
intercept of the load line and the appropriate characteristic curve when 
Vin = 5 V must correspond to a Vcg close to 0 V. 


27.5 NAND AND NOR GATES 


Once we have the three basic logic circuits, we can combine them to form 
other more sophisticated ones. Two gates that are often used in integrated 
circuits (IC) are the NAND and the NOR gates. 

If we take the output signal from the diode AND gate of Fig. 27-13a and 
connect it to the input of the transistor inverter circuit of Fig. 27-16, we will 
have a final output that is the inverse of the simple AND gate output. The 
resulting circuit is one of the family of diode transistor logic (DTL) gates. This 
type of gate will implement the function T = AB, that is, NOT-AND (NAND). 
The electronic symbol for a NAND gate is shown in Fig. 27-17a; the actual 
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NAND gate 
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FIGURE 27-16 

Symbolic schematic of the circuit 
in Fig. 27-15a. The output volt- 
age Vou. is the inverse of the 
input voltage V;,; that is, when 
Vin = OV, Vou = 5 Vand when 
Vin = 5 V, Vou = OV. The cir- 
cuit implements the NOT logic 
function. 


FIGURE 27-17 

(a) Electronic symbol of a NOT- 
AND or NAND gate. (b) A DTL 
circuit that implements the 
NAND logic function. 
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circuit is illustrated in Fig. 27-17b. It is seen that the left side of the circuit in 
Fig. 27-17b, the section containing V2, the resistor R, the two diodes, and the 
two inputs V, and Vj, is the diode AND circuit of Fig. 27-13a. The output of 
that section is AB and appears at point 2. This output becomes the input for 
the right side of the circuit, the section consisting of V2, the resistors Rc and 
Rg, and the transistor. This section is the transistor inverter of Fig. 27-16. 
Thus, whatever input (in this case AB) is fed into the base of the transistor 
through the base resistor Rg appears inverted at the collector of the transistor; 
that is, the final output Vou: (Vcr) = AB. 

A DTL gate that performs the NOT-OR, or NOR operation, can be 
constructed by connecting the output of the diode OR gate of Fig. 27-14a to 
the input of the transistor inverter, Fig. 27-16. The final output will be 
T = A + B (see Problem 27.3). 


27.6 OTHER GATES, RTL AND TTL 


Another family of gates often found in an IC is the resistor transistor logic (RTL) 
gate. Figure 27-18a is an example of such a gate, which implements the NOT- 
OR or NOR function. The electronic symbol for a NOR gate is shown in Fig. 
27-18b. The circuit of Fig. 27-18a is essentially two inverters (Fig. 27-16) wired 
in parallel with V2. and the collector resistor Rc common to both. In Fig. 
27-16 we had only one input, which we called V;,. We now have two inputs 
V, and Vz, and each affects the output of its transistor in the same way as 
Vin did in the inverter circuit. Suppose the V4 = 5 V and Vz = 0 V. Transistor 
A will conduct, that is, ig, will be high and therefore ic will be high (see Fig. 
27-15b). Consequently, the collector of A will be at approximately 0 V with 
respect to ground (Vcg ~ 0 V) and Vou will be 0 V. A similar output will be 
obtained if V, = 0 Vand Vz, = 5 V orif both Va = Vg = 5 V. Thus, if either 
or both inputs are at logic level 1, the output will be at logic level 0. If, however, 
inputs V, = Vg = OV, then neither of the two transistors will conduct (ic = 
0) and both collectors will be at 5 V with respect to their emitters, which are 


(a@) 
(b) 


FIGURE 27-18 

(a) An RTL circuit that imple- 
ments the NOR logic function. 
(b) Electronic symbol for the NOT- 
OR or NOR gate. 


R)< 


grounded; Vou = 5 V. Thus, the circuit in Fig. 27-184 implements the NOR 
logic function. 

The advent of new and sophisticated fabrication techniques has allowed 
the construction and interconnection of hundreds of diodes, transistors, re- 
sistors, and such in a single silicon wafer a few square millimeters in area. 
The advantages of these integrated circuits are many; reduction in size and 
cost, higher speed (due in part to shorter leads), and improved circuitry; 
many more components per function can be used than would be practical 
with discrete components. One such example is the transistor-transistor logic 
(TTL) gate shown in Fig. 27-19. The input transistor Q1 is an npn transistor 
with several emitters. Transistors with eight or more input emitters are not 
unusual nor difficult to manufacture in IC form. If any of the inputs of Q1 is 
grounded (logic level 0), the base-emitter junction of Q1 will be forward biased 
because the n-type emitter is at ground potential whereas the p-type base is 
connected via R; to the + 5-V source and, therefore, it is more positive than 
the emitter. This produces a large base current ig; that turns Q1 on (see Fig. 
27-15b). As a result, the collector of Q1 becomes a virtual ground, that is, the 
voltage between the collector and the emitter of Q1 will be ~ 0 V. This in 
turn will cut off the output transistor Q2 because no current flows in its emitter 
base junction. With Q2 cut off, its collector emitter voltage becomes 5 V, that 
is, Vou. = 5 V. Thus, if any of the inputs are at logic level 0, the output will 
be at logic level 1. 

If all the inputs of Q1 are at 5 V, its base emitter will be reverse biased, 
and consequently no current can flow from the base of Q1 to its emitter. 
Current can flow through R, and through the forward biased base collector 
junction of Q1 into the base of Q2. This will turn Q2 on and make its collector 
emitter voltage ~ 0 V and, therefore, Vout ~ 0 V. Thus, when all the inputs 
are at logic level 1, the output will be a logic level 0. This is logically a NAND 
gate. 

TTL gates have many advantages over RTL and DTL gates. With some 
simple modifications the fan-out’ of this TTL gate can be as great as 15. (For 
RTL gates it is typically 5 and for DTL gates it is 8). The propagation delay for 


‘Fan-out: The number of identical logic gates that can be connected to the output of a gate without 
impairing the output logic level. 

*Propagation delay: The time interval between the introduction of a change in the logic level at 
the input and the appearance of the change at the output. 
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FIGURE 27-19 

Circuit diagram of an integrated 
circuit (IC) TTL gate that imple- 
ments the NAND logic function. 
Transistor Q1 is the input tran- 
sistor that is built with several 
emitters all having the same base 
and collector. The inputs are ap- 
plied at the emitters of Q1. 
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Circuit Function 


NOT 


AND 


NOR 


a TTL gate is about 10 nsec (10 x 10~? sec) as compared with 25 nsec for the 
DTL and the RTL gates. The noise immunity’ is also better for the TTL gates. 
A detailed explanation of these characteristics for the gates discussed here 
can be found in any digital electronics textbook. 

In the preceding sections, we have introduced gates that permit the 
implementation of the various logic functions of boolean algebra. Although 
it may seem unnecessary, the needed logic functions are usually obtained 
with just one type of gate. The reason is that by doing so, the speed of signal 
propagation, the fan-out, and the noise immunity are well known and uniform 
throughout the entire digital system; Fig. 27-20 shows how the different logic 
functions can be implemented with NAND gates. We recall that the NAND 
logic function is the inverse of the AND, and therefore the truth table has an 
output that is the opposite of the one in Table 27-1. In particular, when both 
inputs are 0, the output will be 1 and, when they are both 1, the output will 
be 0. Thus, if we take a two-input NAND gate and interconnect the two 
inputs to make a single input it becomes an inverter: a 0 applied at the common 
input will yield an output of 1, whereas a 1 input will yield a 0 output. 


3Noise immunity: A parameter that refers to the ability of a gate to stay in a given logic state in 
the presence of fluctuations (noise) in the input signal. These fluctuations can be caused by 
variations in the DC power supply, thermal voltages, or pick-up from adjacent lines where the 
current may be varying rapidly. A large noise immunity (also called noise margin) is a desired 
property for a gate, for it means that the output logic level will not change in the presence of 
large fluctuations in the input logic level. 


FIGURE 27-20 
Implementation of several logic 
functions with NAND gates. 


27.7 MEMORY CIRCUITS 


The gates that we have discussed can be used to implement the desired 
boolean functions. Two other elements are needed in the operation of a 
computer; memory circuits to store information, and clocks to drive the com- 
puter, that is, to tell the different parts when to perform a given operation. 
Both these elements can be constructed with the basic gates using feedback; 
in other words, part of the output is fed back into the input. The general term 
used for these feedback circuits is multivibrator. We will first discuss memory 
circuits. 

The basic memory circuit is called the flip-flop. A positive signal feedback 
is used, resulting in two stable output states that can represent the two logic 
levels 1 and 0. The circuit is placed in either state and remains in that state 
indefinitely until the input is changed; it stores a binary bit (a 1 or a 0) indef- 
initely. There are many different flip-flops. We will discuss two basic ones, 
the reset-set flip-flop or simply the RS flip-flop, also called the latch, and the data 


flip-flop or simply the D flip-flop. 


27.7a RS Flip-Flop or Latch 

Consider two NAND gates connected as shown in Fig. 27-21. Keep in mind 
that a NAND gate gives a logic level 1 output if any or all inputs are at logic 
level 0. It gives a 0 level output only when all inputs are at level 1. 

The latch has the peculiarity that if a signal 1 is applied to both the “set” 
and the “‘reset’’ inputs, the resulting outputs are ambiguous. This is of no 
consequence to its operation as a memory circuit, as we will see. 

Suppose that initially both inputs are at level 1 and then the “set” input 
is changed to level 0. This will result in Q = 1 and Q = 0. If the “set” input 
is changed back to 1, the outputs will not change. The reason is that the other 
input to gate 1, Fig. 27-21, is at level 0 when the change in the “‘set’”” input 
is made. The output of gate 1 (Q) will remain at level 1. If now a 0 V pulse 
is applied to the “reset,” Q = 0 and Q = 1. The outputs will remain un- 
changed when the “reset” input is changed back to 1 because the other input 
to gate 2 is at level 0 when the change is made. Thus the circuit “remembers” 
which input had the latest momentary 0-V pulse applied to its input. 


27.7b The D Flip-Fiop 


A type of memory circuit without a need to keep both inputs activated, and 
thereby a more flexible one, can be obtained with a slight modification of the 
latch. Figure 27-22 shows the data or D flip-flop. Note that the output stage, 
gates 3 and 4, is the latch discussed in Section 27.7a. 

In the D flip-flop, the data to be stored is fed only to the D input. The 
T input controls the receptiveness of the circuit to the data being fed into the 
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FIGURE 27-21 
A memory circuit called the RS 


(reset-set) flip-flop (sometimes 
referred to as the latch) con- 
structed with NAND gates. 
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D input. If T = 1, the circuit stores in the Q output the data fed at D; if T = 
0, the circuit ignores the input at D and remains in its previous state. 

Let us assume that a 1 input is fed at D while the level input of T is 1. 
The output of gate 1 will be 0; because this output becomes the input for gate 
2 and gate 3, their outputs will become 1. As we saw when we discussed the 
latch, the Q output will be a logic level 1. Moreover, because the outputs of 
gate 2 and gate 3 become inputs for gate 4, the output Q will be 0. If the T 
input is changed to 0 and then the D input is removed (that is, changed to 
0), the outputs will remain unchanged. The only way to change the output 
Q to logic level 0 is to feed a 0 input at D while T is a logic level 1. 

Thus the D flip-flop stores at Q the data fed into the D input when T = 1 
and keeps it until new data is fed. 


27.8 CLOCK CIRCUITS 


A clock is a circuit that produces periodic pulses. These pulses can be used 
to drive the different gates in the digital instrument. A clock can be constructed 
with a multivibrator whose output states are unstable and as a consequence 
has an output signal that changes periodically between high and low. The 
instability is achieved by using negative feedback. We will now discuss two 
such clocks. 


27.8a Astable Transistor Multivibrator 


Consider the circuit shown in Fig. 27-23. Initially, when the power is turned 
on, both transistors, Q1 and Q2, will tend to conduct. If however, there is 


FIGURE 27-22 
A memory circuit called the data 
flip-flop constructed with four 
NAND gates. 


FIGURE 27-23 

Circuit diagram of the astable 
transistor multivibrator. The 
output V.u4 is a voltage that al- 
ternates between high (5 V) and 
low (0 V). The circuit is therefore 
a clock. 


some type of unbalance, one will be turned on first, and as a consequence 
the other will be turned off. Suppose that Q1 is turned on first. While this is 
taking place, capacitor C; was being charged to some voltage from the +5 V 
source (via Rc, and Q2). As soon as Q1 is fully turned on, point O, becomes 
an effective ground (its voltage is about 0 V). This means that point A; is at 
a negative voltage with respect to ground. As a result the p-type base of Q2, 
which is connected to point A;, will be at a lower potential than the n-type 
emitter of Q2, which is grounded. The base-emitter junction of Q2 is reverse 
biased, and therefore ig for Q2 will be zero. This will turn Q2 off. Although 
Q2 is being turned off, Vou: is not yet at 5 V because current can flow through 
Rez, Cz, and Q1; eventually C, will be charged and Vou will become 5 V. 

The negative voltage at A, kept Q2 off; however, this does not last. While 
Q2 was being turned off, C, was discharging (via R,; and Q1) as a result of 
point O, having become a virtual ground. When C;, is fully discharged, it 
begins to charge in the opposite direction (via the same path), and when A, 
becomes positive Q2 begins to conduct because now the base-emitter junction 
of Q2 becomes forward biased. As a result, V,4, drops immediately to 0 V. 
Because C, has been charged and point O2 is now a virtual ground, A> is now 
at a negative potential with respect to ground. This turns Q1 off, although 
the potential of point O, does not rise immediately to 5 V; it does so expo- 
nentially until C, is charged. At the same time, while C, was being charged, 
C, was discharging (via Ro and Q2). When C; is fully discharged, it begins to 
charge in the opposite direction, and when Az becomes positive, Q1 is turned 
on, making O, a virtual ground; A, becomes now negative relative to ground 
and turns Q2 off. Vout then rises exponentially to 5 V while C, is being charged. 
The cycle is completed. 


27.8b IC Free-Running Oscillator Clock 


A circuit called the free-running oscillator clock made with two inverters is 
shown in Fig. 27-24. Suppose that initially V, = 0 V, V, will then be 5 V, and 
Vz = 0 V. As a consequence, the capacitor C will begin to charge (via R), see 
Fig. 27-25. This charging will continue until the voltage at point a (the input 
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FIGURE 27-24 
A clock, called the free-running oscillator, constructed with two inverters, a capacitor, 
and a resistor. The output voltage oscillates between high and low. The frequency of 
oscillation is determined by the values of the capacitor C and the resistor R. 
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Johann von Neumann (1903-1957) 
is considered the father of the 
modern computer for his devel- 
opment in 1946 of the architec- 
tural concepts of the computer 
mainframe. 


FIGURE 27-25 
Charging of the capacitor C in 
the circuit of Fig. 27-24. 
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of the first inverter) has risen sufficiently to cause the output of that inverter 
to change to low, that is, 0 V. Now V, = 0 V and V, = 5 V. The capacitor 
will now discharge (see Fig. 27-26) until the voltage at point a has dropped 
enough to cause the output of the first inverter to change back to high, that 
is, V, = 5 V and consequently Vz = O V. The cycle begins anew. The RC 
circuit was derived in Section 15.9, and it is seen by the result that the time 
to achieve a predetermined voltage depends on the values of R and C. There- 


FIGURE 27-26 
When the capacitor in Fig. 27-25 
has been charged, the potential 
of point a becomes 5 V. This 
causes the potential of point b in 


fore, the cycle of the clock can be adjusted by these choices. 


PROBLEMS 


Fig. 27-24 to become 0 V and that 
of point d5 V. The capacitor now 
discharges and then begins to 
charge in the opposite direction. 


27.1 Make the appropriate truth tables to prove the 
following distributive law of boolean algebra: 


A (B + C) = AB+ AC 


27.2 Prove DeMorgan theorem A + B = AB by 
making the truth table for both sides of the equation; 
that is, make a truth table for A + B = T and for 
A B = T and show that the result T is the same in 
both cases. 


27.3 Combine the circuits shown in Figs. 27-14a 
and 27-16 to create a DTL gate that will perform the 
logic function NOR. Make up a truth table for the 
possible combinations of the two inputs V, and Vz 
and the resulting output. Present the reasons for 
your results. 


27.4 The inputs shown in Fig. 27-27 are fed into 
the circuit of Fig. 27-18. Draw the output waveform. 


FIGURE 27-27 > fe ae 
Problem 27.4. ¢ (usec) ——> 


27.5 Figure 27-28 shows the MC846, a commercial 
DTL gate. Analyze the circuit and make a truth table 
for the possible combinations of the two inputs V, 


and Vz. What logic function does the circuit per- 
form? 


(Answer: NAND.) 
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FIGURE 27-28 Problem 27.5. 


27.6 Analyze the circuit shown in Fig. 27-29. De- 
termine the logic function performed by the circuit 
by making and justifying the appropriate truth table. 

(Answer: AND.) 


=o | r 


FIGURE 27-29 Problem 27.6. 


27.7 In Section 27-6 we showed that the different 
logic functions could be implemented with NAND 
gates only. Show, using NOR gates only, how to 
implement the AND, the OR, the NOT, and the 
NAND logic functions. 


27.8 An important parameter of a gate is its fan- 
out, that is, the number of identical gates that can 
be connected to the output of the gate without im- 
paring the output logic level. Consider the inverter 
of Fig. 27-16. We saw that if Vi, = 0 V, tp will be 
zero, and this in turn leads to ic = 0; because no 
current flows through Rc, there is no potential drop 
across R-; therefore V,.. = 5 V. Suppose that the 
output of such an inverter is connected to the input 
of a similar inverter as in Fig. 27-30. (a) What is the 
output voltage of the first inverter (potential of point 
A) when its input V;, = 0 V? Take the potential 
across the forward biased junction between the base 
and emitter of Q2 to be 0.4 V. (b) Suppose a third 
inverter, identical to the two in Fig. 27-30, is con- 
nected to the output of the first (point A), what will 
be the potential of point A? (c) What is the fan-out 
of the inverters in Fig. 27-30? 

(Answer: (a) 4.08 V, (b) 3.47 V, (c) 3.) 


FIGURE 27-30 Problem 27.8. 


27.9 (a) Find the truth table for the circuit shown 
in Fig. 27-31. What logic function does the circuit 
perform? (b) What logic function will the circuit per- 
form if the constant +5 V input to the first two gates 


is changed to ground potential? 
(Answer: (a) OR, (b) None.) 
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FIGURE 27-31 
Problem 27.9. VB° 


27.10 (a) Find the truth table for the circuit of Fig. 
27-32. What logic function does the circuit perform? 
(b) What logic function will the circuit perform if the 
common grounded input to the first two NOR gates 
is changed to +5 V? 

(Answer: (a) AND, (b) None.) 


FIGURE 27-32, 


* 
Problem 27.10. © J >! 


27.11 The circuit of Fig. 27-33 is a latch memory 
circuit made with NOR gates. When the circuit is 
not active, both inputs, the 5S (set) and R (reset), are 
kept at logic level 0. Show that when a 1 input is 
fed at S, the Q output becomes 0 and remains 0 when 
the S input returns to 0. Similarly, when a 1 input 
is fed at R, the Q output becomes 0 and remains 0 
when the R input returns to 0. Thus, the circuit 
remembers which input was last at level 1. 
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27.12 The circuit of Fig. 27-34 is a data flip-flop 
made with NOR gates. Show that the circuit stores 
at Q the input fed at the D input if the T input is 0 
and ignores the input at D if the T input is 1. 


FIGURE 27-33 
Problem 27.11. 
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FIGURE 27-34 Problem 27.12. 


27.13 In the binary system of counting, only two 
digits, 0 and 1, are used as opposed to 10 digits in 
the decimal system. The addition of two binary num- 
bers is performed in a manner analogous to that of 
the decimal system. Thus, the sum of 0 and 1 is 1; 
the sum of 1 and 1 is 0 and we carry 1, just as the 
sum of 9 and 1 in the decimal system is 0 and we 


carry 1. The truth table for the addition of two binary 
numbers A and B with the possible combinations of 
A and B and the resulting sum 5 and carry C is 


A 
0 
1 
0 
1 


=a OOo O&O a) 


BIS 
0 | 0 
0 | 1 
1 | 1 
110 


From the truth table, we can see that logically the 
sum S = AB + AB and the carry C = AB. Drawa 
circuit, made up exclusively of NAND gates, that 
will perform the addition of two binary numbers A 
and B. Indicate the output at each step of the circuit. 


27.14 Repeat Problem 27.13 with NOR gates. 
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28.1 INTRODUCTION 


Many of the technological achievements of the past two or three decades have 
been based on microelectronics. Microelectronic devices are at the heart of 
new products ranging from communication satellites to computers and space 
shuttles, to name but a few. The revolution in the electronic technology 
brought about by microelectronics has its roots in metallurgical techniques 
whose development started in the early 1950s. 

In this chapter, we will review the methods used to produce the high 
purity, single crystal wafers (slices) of silicon on which integrated circuits (ICs) 
are made. We will also consider the techniques employed to fabricate ICs. 
The use of these techniques will be illustrated at the end of the chapter when 
we describe the step-by-step production of a simple integrated NOR gate. 
Because silicon is the basic material for computers at the present time, we 
will confine our discussion to the purification and preparation of silicon semi- 
conductors, although other semiconductor materials are being developed. 


28.2 SEMICONDUCTOR PURIFICATION: ZONE REFINING 


The basic element silicon used in the fabrication of semiconductor devices is 
obtained from the chemical decomposition of compounds such as SiO, (the 
main constitutent of ordinary sand), SiHCl;, and SiCl,. By means of different 
chemical reactions the Si is chemically prepared with impurity concentrations 
of about one part per million. The chemically purified silicon is then melted 
and cast into ingots. The resulting ingot is polycrystalline in nature, that is, 
it consists of a large number of small (typically a few microns) single crystals 
having random orientations with respect to one another. To obtain device- 
grade semiconductor single crystals, the impurity concentration must be re- 
duced by several orders of magnitude and the polycrystalline ingots must be 
transformed into large single crystals. The stringent purity requirement is 
achieved by the method known as zone refining. It consists of moving a molten 
zone through the ingots. The molten zone sweeps the impurities along with 
it and thus, after several sweeps of the molten zone in one direction, most 
of the impurities are brought to one end of the ingot. 

One familiar example illustrates the principle on which zone refining is 
based. Let us consider a solution of salt in water. As the temperature of the 
solution is lowered and we reach the freezing point, ice crystals begin to form. 
The first ice crystals formed contain less salt (in fact, almost no salt) than the 
initial solution. As a consequence, the salt concentration of the remaining 
solution increases and concomitantly its freezing point is lowered. These facts 
are not unique to salt and water. The behavior of solutions, their temperature- 
concentration characteristics, are presented in phase diagrams. One hypo- 
thetical diagram is shown in Fig. 28-1. 

Referring to Fig. 28-1, let us start with liquid silicon at a temperature T, 
with an impurity concentration C,. The temperature is lowered to T>. At this 


P| 
' Liquid pnase 
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Solidus line 
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Impurity concentration——» 


temperature, silicon can exist as a solid with a maximum impurity concen- 
tration of C2, shown by the intersection of the dashed temperature line with 
the solidus line. Consequently, the silicon that crystallizes first will have an 
impurity concentration C, whereas the remaining liquid, because the impur- 
ities are still present in the total sample, will have a higher impurity concen- 
tration than before, such as C3. No further crystallization will take place until 
the temperature of the liquid is lowered to T3, when crystals with impurity 
concentration C, are formed, and so on. Zone refining exploits this redistri- 
bution of impurities between the solid and liquid phases at the freezing point. 
Figure 28-2 shows one of the zone refining process arrangements. 

The Si ingot is placed in a chemically stable, nonmelting boat and is 
pulled slowly through a furnace. A narrow region of the furnace is maintained 
at a temperature above the melting point of Si by means of a heating coil. 
Thus, a narrow molten zone is created in the ingot and, as the boat moves 
along the furnace, the molten zone moves along the ingot. The moving zone 
leaves behind it relatively pure solid silicon, whereas the impurities remain 
in the melted region, or zone. Repeated passes of the molten zone in one 
direction will result in an ingot with one end highly purified. The impure 
end of the ingot is then cut off. In today’s commercial zone refiners, the boat 
is kept stationary inside the furnace and moving molten zones are produced 
by moving a series of heating coils along the walls of the furnace. 


28.3 SINGLE-CRYSTAL GROWTH 


The production of even the simplest of ICs is complex, time consuming, and 
prone to defects. Despite this, highly sophisticated ICs can be produced at 
reasonable cost. One of the main reasons is that methods have been developed 
in the last few decades for growing large single crystals of Si, thereby per- 
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FIGURE 26-1 
Phase diagram showing why im- 


- purities segregate from a solu- 


tion on freezing. 


Furnace walls Molten zone 


Heating coil Refractory 
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FIGURE 28-2 

Schematic of the arrangement for 
zone refining. As the boat is 
moved to the left, the molten zone 
sweeps with it the impurities to 
the right end of the boat. 
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FIGURE 28-3 
Large single crystals of silicon grown by the Czochralski method of Fig. 28-4. 


mitting mass production. Samples 50 cm long and 10 cm in diameter are not 
unusual. Figure 28-3 shows two single-crystal ingots approximately 70 cm 
long and 12 cm in diameter. From such ingots, hundreds of wafers can be 
cut. In each of these wafers, 10 cm in diameter, hundreds of identical inte- 
grated circuits are formed using the techniques described later in this chapter. 
By processing several wafers simultaneously, the cost of the individual IC is 
minimized. We will now discuss some of the methods used for single-crystal 
growth. 


28.3a The Czochralski Method 


One of the techniques commonly used to grow single-crystal silicon consists 
in dipping a small seed crystal of Si into a crucible containing the molten 
silicon. The seed is then slowly raised from the melt. As the seed is lifted 
away from the melt a single crystal grows continuously onto the seed. A 
schematic of this method, known as the Czochralski method (named for the 
inventor), is shown in Fig. 28-4. 

To average any variations in the temperature of the melt and thus ensure 
uniformity in the growing crystal, the crystal is rotated slowly (a few revo- 
lutions per minute) as it is being raised. The conditions needed for good 
single-crystal growth are initially determined by trial and error. For example, 
the rate at which the growing crystal is raised is a crucial factor, and a few 


_ Growing 
crystal 


FIGURE 28-4 
Schematic diagram of the Czo- 
chralski method for growing sin- 
gle crystals. 


millimeters per hour is a fairly typical rate. Once these growing conditions 
are determined, they can be readily reproduced. 


28.3b The Bridgman-Stockbarger Method 


Another method used for growing single crystals, called the Bridgman-Stock- 
barger method, is shown in Fig. 28-5. The material is placed in a crucible 
having a conical tip. The sample, which is initially polycrystalline, is first 
melted in an upper furnace that is maintained a few degrees above the melting 
point. The crucible is then slowly lowered into a second furnace with a tem- 
perature a few degrees below the melting point. As the crucible enters this 
lower furnace, a small crystal is formed at the tip of the crucible. This crystal 
acts then as the seed for the rest of the melt and, as the crucible continues 
to be lowered, a single crystal is grown. 


28.3c Floating Zone Method 


The two methods just described have the disadvantage that the melt tends 
to dissolve some of the oxygen from the walls of the crucible (usually made 
of silica, SiO, which has a higher melting point than silicon). The solution 
to this problem is found in the floating zone method, which dispenses with 
the crucible altogether. A schematic of the arrangement used in the floating 
zone method is shown in Fig. 28-6. 

A polycrystalline rod of the crystal to be grown is held between two 
support posts inside a furnace filled with an inert gas. A small seed crystal 
is placed between the lower support post and the rod. A small molten zone 
is created at the end of the rod in contact with the seed by means of a movable 
external heating coil. As the coil is slowly raised, the molten zone moves 
upward, while the crystal solidifying behind it grows onto the seed. The 
molten zone is held between the unmelted ends by surface tension. The 
floating zone technique is also used for zone refining. 


28.3d Vapor-Phase Epitaxy 


A crystal-growing method that plays an important role in the fabrication of 
ICs is vapor-phase epitaxy. Very often the whole integrated circuit is made on 
a layer of Si 10 or 20 wm thick that is grown epitaxially onto a Si substrate 
(wafer). The word “epitaxial” originates from the Greek word meaning “ar- 
ranged on.” In this method, atoms of Si from a vapor are deposited on the 
substrate in a layer that has the same crystal structure and orientation as the 
substrate. Thus, the substrate serves as the seed crystal onto which the ep- 
itaxial layer grows. This first layer in turn serves as the substrate for the 
second layer, and so on. A schematic of the apparatus used for vapor epitaxy 
is shown in Fig. 28-7. 
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Schematic diagram of the Bridg- 
man-Stockbarger method for 
growing single crystals. 
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Schematic diagram of the float- 
ing zone method for single-crys- 
tal growth. 
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FIGURE 28-7 
Arrangement for growing Si lay- 


ers on silicon wafers by epitaxial 
growth. 
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Single crystal wafers of Si are placed in a heated chamber called the 
“reactor.” Gaseous compounds of silicon (for example, silicon tetrachloride, 
SiCL,), together with the appropriate reactant gas, are introduced into the 
reactor chamber. The temperature of the reactor is adjusted to produce the 
reaction that will liberate the silicon by decomposition of the compound. Thus, 
for example, at 1250°C the reaction 


SiCl, + H,— Si + HCl 


occurs. Some of the Si atoms released in the reaction are deposited onto the 
silicon substrates, thereby forming epitaxial layers. If the chemicals used for 
the reaction are of high purity, the epitaxial layer of Si will be highly pure. 
Alternatively, the layer can be deliberately doped, making it either p-type or 
n-type, by passing the hydrogen through a solution containing boron or 
phosphorous atoms (for example, boron trichloride or phosphorous trichlor- 
ide), before it is introduced into the reactor. 


28.4 THE PROCESSES OF IC PRODUCTION 


The processes involved in the fabrication of integrated circuits include epitaxial 
growth, oxidation, oxide removal and pattern definition, doping (introduction 
of selective impurities in the Si), and interconnection of components. Epitaxial 
growth has been discussed in Section 28.3d. We will now explain the addi- 
tional steps in the manufacturing of integrated circuits. 


28.4a Oxidation 


A key step in the production of an IC is the formation of a silicon dioxide 
(SiO2) layer on the surface of the silicon. This oxide layer permits the opening 
of windows on the silicon surface by the method described in the next section. 
The oxide layer also protects p-n junctions from contamination. Finally, be- 
cause SiQ, is nearly an insulator, the oxide allows the interconnection of the 
circuit components by means of thin aluminum strips without short-circuiting 
sections of the IC. 

The oxide layer is grown by heating the silicon wafer to temperatures 
ranging between 1000°C and 1200°C in an atmosphere of either pure oxygen 
or steam. The thickness of the oxide layer depends on the oxidation time and 
the temperature and the composition of the atmosphere in which the oxidation 
is performed. By careful selection of these three parameters, the exact thick- 
ness of the layer can be controlled. A layer 0.1 xm thick can be grown in one 
hour, at T = 1000°C, in pure oxygen. In the same time, a layer 0.5 wm thick 
grows in a steam environment. 


28.4b Pattern Definition 


To make an integrated circuit, a method of creating accurate patterns on the 
silicon wafer is needed. Photolithography (or masking) allows the removal of 
the silicon dioxide in the desired sections of the wafer. Once these “windows” 


Si wafers being unloaded from 
an oxide growth furnace. 
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UV radiation 


FPhotoresist 


Si wafer 


1. Oxide formation 2. Application of photoresist 3. Masking and exposure to 
ultraviolet radiation 


4. Removal of unexposed 5. Etching in HF acid: 6. Removal of exposed 
photoresist resulting in formation of photoresist 
‘windows’ in the SiQ> layer 


FIGURE 28-8 
Steps involved in the masking process (photolithography) used to nepen “windows” 


selectively on the silicon wafer. 


have been opened, diffusion of dopants or deposition of metallic contacts can 
be performed. 

The process of photolithography is carried out as follows. The oxidized 
wafer is coated with a thin layer of a photosensitive material called photoresist. 
This is done by placing a drop of a solution containing the photoresist on the 
wafer. A thin film is then formed by spinning the wafer very rapidly. Finally, 
the wafer is heated to speed up the evaporation of the solvent and to enhance 
the adhesion of the photoresist film to the oxide layer. A mask with the desired 
pattern is then placed on the photoresist film. The wafer is exposed to ultra- 
violet light that changes the structure of the exposed photoresist so that the 
exposed and unexposed parts have different solubilities in certain chemical 
solutions. Thus, exposure to the ultraviolet light, followed by development 
in the appropriate chemical solution, allows the removal of the unexposed 
section of the photoresist. The sections of the silicon dioxide layer not pro- 
tected by the photoresist are then etched away in a solution of hydrofluoric 
acid (HF), which selectively attacks the SiO, while leaving the photoresist 
and the silicon intact. After the window pattern has been opened in the SiO,, 
the remaining photoresist is washed away with the appropriate solvent. The 
wafer is now ready for the introduction of the dopants or for the evaporation 
of metallic contacts. The various steps involved in the formation of these 
“windows” are illustrated in Fig. 28-8. 


28.4c Doping 


An integrated circuit has the colloquial name chip, which we will now begin 
to use. The formation of circuit components in a chip is achieved by the 
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selective introduction of donor and acceptor impurities into the Si wafer to 
create localized n-type and p-type regions. The two most commonly used 
techniques are diffusion and ion implantation. 


Diffusion 


When Si is heated to temperatures in the neighborhood of 1000°C, some of 
the semiconductor atoms move out of their lattice sites, leaving behind empty 
lattice sites that can migrate through the sample. If the heating is done in an 
atmosphere of either phosphorous or boron atoms, these impurity atoms 
move into the vacant lattice sites at the surface of the silicon and subsequently 
migrate slowly into the bulk of the Si with the assistance of the vacant lattice 
sites formed at high temperature. The diffusion of the dopant impurities can 
be stopped by cooling down the wafer. Because this solid state diffusion of 
impurities is time and temperature dependent, the depth of the diffusion 
layer can be controlled by varying these two parameters. Phosphorous atoms 
can be diffused up to a depth of 1 um by heating the Si crystal to 1000°C for 
1h. Another important aspect of the diffusion of either boron or phosphorous 
impurities is that at the same temperature they move much more slowly in 
9iO; than in pure Si. Thus, the oxide pattern, formed by the photolithographic 
method described earlier, acts as a mask that permits the diffusion of the 
impurities only in specific regions of the wafer. 

Figure 28-9 shows a schematic of a typical diffusion furnace ‘ised in the 
fabrication of Si chips. The dopants to be diffused into the silicon wafers are 
introduced into the heated furnace by means of an inert carrier gas that is 
bubbled through the molten dopant (B, P, or such). Alternatively, gaseous 
compounds of B or P, such as diborane (B2Hg) or phosphorous trifluride (PFs), 
can be introduced directly into the furnace. 


lon Implantation 


An alternative method for introducing impurities into the semiconductor is 
ion implantation. After being ionized, atoms of the required dopant are accel- 
erated in a vacuum through a potential difference of several thousand volts. 
The accelerated ions are directed onto the masked silicon substrate (see Fig. 


28-10). As the impurity ions enter the Si, they make multiple collisions with 


the lattice ions and eventually come to rest. Because these collisions are ran- 
dom processes, not all the ions penetrate the same depth into the semicon- 
ductor. In Fig. 28-11 we show a typical distribution of the dopant atoms as 


FIGURE 28-9 

Schematic diagram of the appa- 
ratus used for the introduction of 
impurities into the unmasked 
sections of the silicon wafer by 
diffusion. 
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FIGURE 28-10 
Schematic of the ion implanta- 
tion method for doping the un- 
masked sections of the silicon 
wafer. 
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a function of the distance from the surface of the Si wafer. The distribution 
is approximately gaussian about an average penetration d. 

The average penetration d depends on the type of impurity and on the 
energy (the voltage through which the ions are accelerated) with which they 
strike the surface of the Si. This latter fact can be exploited to produce a fairly 
uniform profile of impurities. Figure 28-12 shows the result of varying the 
accelerating voltage several times during the implantation process. Each of 
the gaussian distributions (dashed lines) corresponds to a different acceler- 
ating potential. The overall distribution (solid line) is obtained by summing 
up the individual gaussian distributions, and the sum, as can be seen from 
the graph, is relatively flat. 

Ion implantation has many advantages over diffusion. The process is 
performed at room temperature. This permits the implantation of doped layers 
without disturbing previously diffused or implanted layers. Because the im- 
purities are ionized, they represent a current that can be measured very 
accurately. This permits accurate control of the impurity concentration. Ion 
implantation can be used with impurities that do not diffuse easily in Si. The 
relatively recent use of arsenic as a dopant in MOS (metal-oxide-semicon- 
ductors) devices is due to the advent of ion implantation. 

One problem encountered with ion implantation is that the very energetic 
ions cause considerable lattice damage as a result of the collisions. This dif- 
ficulty is corrected by annealing, that is, heating the wafer moderately to 
about 800°C, after implantation. 


> 


Dopant concentration 


Distance from surface of wafer 


28.4d Connection of Components in a Chip 


An IC chip consists of many superimposed doped layers. To complete the 
circuit, the electronic components within a layer as well as the layers them- 
selves must be electrically connected. This can be done either by forming (by 
diffusion or ion implantation) heavily doped (and therefore conductive) re- 
gions of silicon or by metal electrodes. This latter method is performed by 
metallic thin film evaporation. A schematic of the setup used for the evaporation 
of thin metallic films is shown in Fig. 28-13. A charge of the metal to be 
evaporated (usually aluminum or gold) is placed in a crucible. The silicon 
wafers are placed above the crucible in a device called the planetary. Vapor 
of the metal is produced by heating the crucible with a heater coil wrapped 
around it, or by electron bombardment. As the metal vapor hits the cooled, 
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FIGURE 28-11 

Profile of impurities introduced 
by the ion implantation method. 
The distribution of impurities is 
approximately a gaussian curve 
centered about some average 
penetration d from the surface of 
the wafer. 


FIGURE 28-12 

Profile of impurities obtained by 
varying the accelerating voltage 
four times during the implanta- 
tion process. Each of the four 
gaussian curves (dashed lines) 
represents the distribution of im- 
purities for a particular value of 
the accelerating voltage. The solid 
line, which gives the total distri- 
bution of impurities, is the sum 
of the four individual gaussians. 
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masked wafer, it condenses on it, thus forming a thin metallic layer in a 
desired pattern that connects the different sections of the IC chip. The whole 
evaporation process is performed in a vacuum to avoid contamination of the 
metal vapor with the oxygen in the air. 


28.5 ELECTRONIC COMPONENT 
FABRICATION ON A CHIP 


In this section we will illustrate how the techniques just described can be 
used to fabricate the basic components of a circuit: transistors, diodes, resis- 
tors, and capacitors. For simplicity, we will discuss the formation of each 
component separately. In the actual fabrication of an IC, parts of several 
components are formed simultaneously. This will be illustrated at the end of 
this section, where we discuss how a simple gate can be made. 


28.5a Transistors and Diodes 


The steps needed to build a bipolar transistor on a Si wafer are illustrated in 
Fig. 28-14. 


1. An epitaxial (called an epilayer) n-type layer is grown onto a p-type wafer 
(Fig. 28-14a). Part of this layer will serve as the collector. 


2. The n-type epilayer is then oxidized, masked, exposed to UV light, and 
so on, resulting in the formation of a window in the oxide layer (Fig. 
28-14D). 

3. Acceptor-type impurities are diffused to convert part of the exposed 
n-layer to p-type (Fig. 28-14c). A part of this p-type island will be the base 
of the transistor. 


Current feed-throughs 


FIGURE 28-13 
Schematic representation of a 
vacuum chamber used for vapor 
deposition of metallic intercon- 
necting strips. 
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Steps involved in the fabrication of a transistor on a silicon chip. 


4. The wafer is again oxidized, a window is opened in the new oxide layer, 
and a diffusion of donor impurities is performed (Fig. 28-14d), reconverting 
part of the p-type island to n-type. The latter region is the emitter of the 
transistor. 


5. In the final step the wafer is reoxidized and three windows are opened: 
one into the collector, one into the base, and one into the emitter. Alu- 
minum is evaporated to connect the three elements of the transistor (emit- 
ter, base, and collector) to other components of the circuit (Fig. 28-14e). 


The steps needed to build a diode are identical to those used in the 
fabrication of a transistor except that the last diffusion of donor impurities 
(step 4) is omitted. 


28.5b Resistors 


An integrated circuit resistor can be made by the shallow diffusion of a 
p-type channel into an n-type region or vice versa. The current is forced to 
flow through the channel by maintaining the channel at a negative voltage 
with respect to the surrounding n-type region. The resistance of the channel 
will be determined by its length, its cross section, as well as the doping 
concentration. Because of its relatively high conductivity, Si is not a useful 
material for a resistor. As a result, it is difficult to obtain large resistance 
values in ICs without using too much space in the chip. In cases where large 
resistors are needed, one standard approach is direct substitution. As we saw 
in Chapter 26, a transistor used in the common emitter configuration can be 
considered as a base current-controlled resistor. Thus, a transistor can be 
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introduced in a circuit where a resistor is needed. The effective resistance 
between the collector and the emitter will be determined by the base current. 
Circuit designers often introduce transistors in a circuit where resistors might 
have been employed to save space on the chip. 


28.5c Capacitors 
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A capacitor is essentially two conducting electrodes separated by a very thin Figure 29-15 


insulator. The first electrode of the microelectronic capacitor is usually made 
by doping very heavily (thus making it highly conductive) a region of the 
epitaxial layer. This region is covered with a SiO, layer as the insulator and 
the second electrode is formed by evaporating a conducting aluminum film 
on the oxide layer. A schematic of an integrated circuit capacitor is shown in 
Fig. 28-15. | | 


28.5d Fabrication of a Simple IC 


As we mentioned earlier, in the fabrication of an IC parts of several circuit 
components are often formed simultaneously. We will show how the simple 
NOR gate of Fig. 28-16 can be made (see Problem 27.3). 

The first step consists in growing three n-type islands on a p-type sub- 
strate. One of these islands will serve to construct the transistor, another the 
diodes, and the third the three resistors. These three islands are isolated 
electrically from one another by maintaining them at a positive potential with 
respect to the p-type substrate; that is, the substrate-islands junctions are 
reverse biased. To form these three islands, an n-type epilayer is grown onto 
the p substrate (Fig. 28-172). The epilayer is then oxidized and coated with 
photoresist (Fig. 28-17b). The wafer is exposed to UV radiation through the 
mask shown in Fig. 28-17c. After developing the photoresist (removing the 
unexposed parts) and etching the wafer with HF (removing the unprotected 
SiO), the wafer has the form that is illustrated schematically in Fig. 28-17d. 
The exposed photoresist is dissolved away, and acceptor type impurities are 
diffused into the n-type epilayer. The diffusion is allowed to continue until 
the entire epilayer is converted to p-type, except for the three regions protected 
by the SiO, coating. A top view of the wafer at this stage is shown in Fig. 
28-17e. The wafer is now reoxidized. Using the mask shown in Fig. 28-17f 
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FIGURE 28-16 
Electronic diagram of a DTL NOR 


gate. 
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FIGURE 28-17 
Steps involved in the fabrication of the NOR gate of Fig. 28-16 in integrated form. 
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and following the procedure outlined earlier for window opening in the oxide 
layer (photoresist coating, masking, UV radiation, and so on), windows with 
the shape of the shaded areas of mask (f) are opened in the n-type islands. 
Acceptor-type impurities are diffused through these windows into the three 
n-type islands. In this case, the p-type dopant is not allowed to penetrate all 
the way through the n-type layers. When this step is completed, there are 
three resistors in the upper island, the collector and the base of an npn tran- 
sistor in the lower left island, and two diodes with their n side common in 
the lower right island. The state of the chip is shown in Fig. 28-179. The wafer 
is oxidized again, and, using the mask shown in Fig. 28-17h, a window is 
opened onto the base of the transistor for the diffusion of n-type dopants to 
form the emitter. 

All the circuit components are now in place, and they must now be 
interconnected. To achieve this, the wafer is reoxidized again, and, with the 
mask shown in Fig. 28-171, small openings are made in the ends of the re- 
sistors, the three elements of the transistors, and both sides of the diodes. 
The wafer is then covered with the mask of Fig. 28-17j, and an aluminum 
evaporation is performed. The final structure is shown in Fig. 28-17k. The 
circuit is redrawn, using electronic symbols, in Fig. 28-171. 


28.6 CONCLUSION 


In this chapter, we have reviewed the technological processes used in the 
manufacture of integrated circuits. All these techniques became available by 
1960, when the era of integrated circuits began. Since then, the progress has 
been truly amazing. A modern-day integrated circuit 5 mm square contains 
more electronic components than the most complex piece of electronic equip- 
ment built before 1950. The $100 hand-held calculator has more computing 
capability than the ENIAC, the first large electronic computer. It is 20 times 
faster and thousands of times more reliable, has a larger memory, occupies 
1/30,000 the volume, consumes the power of a hearing aid rather than that 
of an electric heating system, and its price tag is 1/100,000 that of the ENIAC. 

The main reason for this rapid development has been the steady increase 
in component density, that is, the implementation of more and more compo- 
nents in a given area of the silicon wafer. The evolution of component density 
is illustrated in Fig. 28-18, known as Moore’s curve (Dr. Gordon Moore pre- 
dicted this evolution in 1964, and his prediction has been proven true up to 
now). Moore’s curve simply states that the component cen on a chip 
quadruples every two years. 

The increase in component density has been achieved by reducing the 
dimensions of the circuit elements. This has been made possible by improving 
the resolution of the photolithographic process. The line widths of the masks 
used to expose the photoresist to the UV radiation is limited, because of 
diffraction, to a few wavelengths. With ultraviolet radiation, this is of the 
order of a micrometer, a limit that has now been reached. In the search for 
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A silicon wafer on which over 
one hundred identical integrated 
circuits have been simulta- 
neously fabricated. The wafer is 
subsequently scribed and broken 
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tions, the chip is encapsulated in 
a protective plastic casing. 
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further reduction of the component size, methods are being developed in 
which ultraviolet radiation is being replaced by X-rays and electron beams. 
We may reasonably expect that component density of Si chips will continue 
to increase, although probably not as rapidly as Moore’s curve predicts. Cur- 
rent research on organic molecules as circuit components may someday create 
a new era of component density. | 

Concommitant with the reduction in the size of the circuit elements, there 
has been a steady improvement in the performance of the device and a re- 
duction in production costs. Because delay times are directly proportional to 
the dimensions of the circuit, the circuit becomes faster as it becomes smaller. 
Moreover, as circuits become more integrated, the number of external inter- 
connections decreases, leading to an increase in the reliability of the device 
and a reduction in maintenance costs. Further reductions in cost are associated 
with the fact that power dissipation decreases as the size of the device de- 
creases. In addition to the energy savings, this leads to savings in such ac- 
cessories as power transformers, cooling fans, racks, and cabinets. 

Microelectronics has now gone through MSI (Medium-Scale Integration, 
a few hundred components per chip), LSI (Large-Scale Integration, a few 
thousand components per chip) and VLSI (Very-Large-Scale Integration). It 
is rapidly progressing into the SLSI (Super Large-Scale Integration with several 
hundred thousand components per chip). We are now technically able to 
implement a complete computer on a single chip selling for a few dollars. 
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FIGURE 28-18 
Moore’s curve showing that since 
1964 the number of circuit com- 
ponents on a chip has quadru- 
pled every two years. 


The BELLMAC-4 a one chip 
computer developed for tele- 
communications is compared to 
a standard size paper clip. 
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Thermodynamics, first law of, 124 Voltage-current characteristics 
Thermometers, 113 of diode, 461, 464 
Thompson, J. J., 264 of JFET, 475 
Tight-binding approximation, 407 of MOSFET, 476 
Time, 38 of transistor, 471 
collision, 367 Voltage difference, 195 
constant in RC circuit, 223 Voltage drop, 210 
unit of, 3 Voltmeter, 210, 219 
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on current loop, 230 Water molecule, 357 
Transistor, 465 Watt, 62, 221 
base, 465 Wave front, 162 
bipolar junction (BJT), 465 Wavefunction, 285 
collector, 465 of hydrogen, 333 
common base configuration, 465 required properties of, 309 
common emitter configuration, 471 significance of, 285, 302 
current amplifier, 473 Wavelength, 147 
current gain parameter, 472 Wave number, 148 
emitter, 465 Wave packet, 290 
field effect (FET), 473 group velocity, 292 
inverter, 492 Wave-particle duality, 293 
junction field-effect (JFET), 474 Wave(s), 146, 147 
load line, 492 amplitude of, 146 
metal-oxide-semiconductor (MOSFET), 476 electromagnetic, 237 
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unipolar, 474 matter, 280, 283, 289 
voltage amplifier, 469 intensity of, 153 
Traveling wave, 147 standing, 173, 315 
Truth table, 482 traveling, 147 
for AND operation, 483 velocity of, 146, 147 
for OR operation, 483 wavelength of, 146 
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Uncertainty principle, 285 Well, infinite potential, 311, 370 
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Valence band, 416 Wiedemann-Franz law, 387 
Valence electrons, 344, 363, 370 Wien’s displacement law, 260 
Van der Waals bond, 356 Work, 54, 123 
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distribution of electrons, 382 

group, 292, 419 
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X-rays, 253 
characteristic, 254 
crystal structure and, 348, 438 
cut-off wavelength, 254 
diffraction of, 170 
production of, 253 
spectrum, 254 

Xerography, 449 


tangential, 83 Young, T., 163 . 

wave, 147 Young’s double slit experiment, 162 
VLSI, 517 
Volt, 195 Zeeman, P., 326 
Volta, A., 194 ‘Zeeman effect, 326 
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1820 
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1823-71 


1847-54 


1850s 


1873 


1885 


1900 


1905 


Milestones in the Development of the Concepts Presented in this Book 


Galileo Galilei publishes his Dialoge in which he 
presents his ideas on the motion of the solar 
system. 


Robert Boyle enunciates the pressure-volume law 
of gases. 


The work of René Descartes on his concepts of 
physical science, including momentum, is pub- 
lished posthumously in Amsterdam. 


Isaac Newton publishes his Principia enunciat- 
ing three laws of motion and the law of universal 
gravitation. 


Charles Coulomb discovers the law of force be- 
tween charges. 


Henry Cavendish measures the gravitational 
constant. 


Thomas Young demonstrates the phenomenon 
of interference of light, thus establishing its wave 
nature. 


Amadeo Avogadro proposes that gram-mole- 
cular weights of substances have the same num- 
ber of atoms or molecules. 


Hans Oersted discovers electromagnetism. 


Georg Ohm states the relationship between 
voltage, current, and resistance. 


Charles Babbage designs the first programmable 
digital calculator with memory. Although it was 
never built, he is recognized as the father of the 
modern computer. 


George Boole pioneers symbolic logic and de- 
velops boolean algebra. 


Julius Mayer, James Joule, Hermann von Helm- 
holtz, and L. A. Colding independently estab- 
lish the equivalence of heat and energy. 


James Maxwell synthesizes the laws of elec- 
tromagnetism and predicts the existence of elec- 
tromagnetic waves. 


Heinrich Hertz verifies experimentally Max- 
well’s theorv of the existence of electromagnetic 
waves. 

Max Planck introduces quantum theory to ex- 
plain blackbody radiation. 

Albert Einstein proposes the quantization of 
electromagnetic radiation (photons) to explain 
the photoelectric effect. 


1909 


1911 


1913 


1914 


1921 


1923 


1924 


1925 


1925 


1926 


1927 


1927 


1927 


1928 


1939 


1946 


1948 


1958 


1976 


Robert Millikan performs oil drop experiments 
that determine the charge of the electron. 


To explain the alpha particle scattering experi- 
ments of Hans Geiger and Ernest Madden, Er- 
nest Rutherford proposes the nuclear planetary 
model of the atom. 


Neils Bohr proposes the quantum mechanical 
model of the hydrogen atom. 


James Franck and Gustav Hertz verify the quan- 
tization of the internal energy of atoms. 


Otto Stern and Walter Gerlach discover space 
quantization. 


The increase in wavelength of scattered X rays 
is explained by Arthur Compton with the pho- 
ton theory. 


Louis de Broglie proposes the wave nature of 
particles. | 


Wolfgang Pauli enunciates the exclusion prin- 
ciple. 

To explain the fine structure in the spectrum of 
atoms, Samuel Goudsmit and George Uhlen- 
beck introduce the concept of the electron spin. 


Erwin Schrédinger presents his theory of quan- 
tum mechanics. 


Max Born suggests the probabilistic interpreta- 
tion of the wavefunction. 

Werner Heisenberg presents the Uncertainty 
Principle. 

Clinton Davisson and Lester Germer verify the 
wave nature of electrons. 


Arnold Sommerfeld presents the quantum me- 
chanical free electron theory of solids. 


A. H. Wilson develops the general model of 
semiconductors. 


Johann von Neumann develops the concepts of 
the modern computer mainframe. 

John Bardeen, Walter Brattain, and William 
Shockley construct the first transistor. 

The first integrated circuit on a silicon chip is 
developed at Texas Instrument and Fairchild 
Semiconductor. 


Intel announces the production of an eight-bit 
computer on a single silicon chip. 
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Atomic and Semiconductor Data 


Electronic charge 

Mass of the electron 

Mass of the proton 

Mass of the neutron 

Bohr radius 

Ionization energy of hydrogen 
Effective mass of electrons in silicon 
Effective mass of holes in silicon 
Energy gap (E,) in silicon 

Effective mass of electrons in germanium 
Effective mass of holes in germanium 
Energy gap (E,) in germanium 


Conversion Factors 


leV = 1.6 x 10°'"J 
1A =107-%m 

1 calorie = 4.19 J 

1u = 1.66 x 10°?’ kg 


1.6 x 10°"%C 

9.11 x 10-*' kg 

1.67 X 107?” kg 

1.67 X 10~?” kg 

5.3 x 1071!m 

13.6 eV 

0.31 x 9.11 x 1073! kg 
0.38 x 9.11 x 107%! kg 
1.1 eV 

0.12 x 9.11 x 1073! kg 
0.23 x 9.11 x 107%! kg 
0.67 eV 


Some General Constants 


Avogadro’s number 
Boltzmann constant 


Coulomb constant 
Gravitational constant 
Permittivity of free space 
Planck constant 


Speed of light 
Universal gas constant 


Mathematical Symbols 

# is not equal to 

~ is approximately equal to 
~ is of the order of 

« is proportional to 

> is greater than 

= is greater than or equal to 
< is less than 

> is much greater than 

< is much less than 


Na = 6.02 Xx 1073 molecules/mole 


kg = 1.38 x 10°33 J/K 

= 8.63 x 10°>> eV/K 
1/47e9 = 8.99 x 10° N-m2/C2 
G = 6.67 x 107"? N-m*/kg? 
Eo = 8.85 x 107! C2/N-m?2 
h = 6.63 x 10 ~* J-sec 

= 4.14 x 107 eV-sec 
3.00 x 10° m/sec 
R = 8.31 J/mole-K 
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